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STRUCTURE FORMELLE DES TEXTES ET COMMUNICATION 
Deux ErupEs PAR 
Benoit MANDELBROT! 


Sommaire: Introduction, p. 1. Sur le caractére discret des signes linguis- 
tiques, p. 3. Sur les mots comme unités concrétes de tout texte (Généralisa- 
tion et explication de la loi statistique d’Estoup et de Zipf), p. 9. Bibliogra- 
phie, p. 26. Les renvois & la bibliographie comportent le nom de |’auteur et la 
date du travail cité. English summary, p. 27. 


I. INTRODUCTION? 


Les ‘‘textes”’, dont nous nous proposons d’étudier certaines propriétés structu- 
rales, seront par définition des échantillons de comportement linguistique. 
Chaque texte sera df 4 un seul sujet, et sera enregistré pendant un temps assez 
court pour que l’auteur n’ait pas changé pendant qu’il l’émettait, et assez long 
pour bien représenter les possibilités d’expression de cet auteur. Une hypothése 
fondamentale, implicite dans toutes les sciences du langage, est que ces deux 
limites de temps sont trés différentes l’une de |’autre, et que, par suite, on ne 
risque pas de commettre des erreurs trés considérables en estimant 4 priori de 
fagon approximative si un échantillon donné tombe bien entre ces limites, et 
doit étre considéré comme texte. 

Les objets d’étude de la linguistique et des sciences connexes sont les textes 
eux-mémes, des éléments dont la juxtaposition dans le temps peut constituer un 
texte, des ensembles de textes d’une méme langue, c’est-4-dire dus 4 des sujets 
se comprenant mutuellement, ou encore des ensembles de textes plus généraux. 

Un des aspects de tout texte est un “aspect pragmatique”’: il est successive- 
ment constitué par plusieurs objets physiques, dont certains sont manipulés par 


1 Les recherches mathématiques, dont ceci est un compte rendu 4 l’usage des linguistes, 
ont été effectués aux Laboratoires d’Electronique et de Physique Appliquées, rue du Re- 
trait, Paris, France, et au Research Laboratory of Electronics, Massachusetts Institute 
of Technology, Cambridge 39, Mass., U.S.A. The research performed at M.I.T. was sup- 
ported in part by the Signal Corps, the Air Materiel Command, and the Office of Naval 
Research of the United States of America. De nombreux savants frangais et américains 
ont examiné et critiqué plusieurs versions précédentes de ce travail, échelonnées sur deux 
années, sous des titres variés. Nous leur en exprimons ici notre vive reconnaissance. Nous 
sommes, tout particulitrement, redevable 4 Mme Jeanne Martinet de suggestions aussi 
nombreuses que détaillées, que nous nous sommes efforcé d’incorporer dans la rédaction 
finale. 

? Nous recommandons au lecteur de revenir 4 cette introduction aprés avoir achevé la 
lecture de chacune des deux études qui la suivent. 
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des organes physiologiques; ces objets ont pu étre émis, transmis puis recus et 
identifiés, et quelquefois répétés. Le postulat fondamental de la linguistique est 
qu’un texte posséde également un aspect de “message”, suite de symboles dis- 
crets sur plusieurs niveaux hiérarchisés: phonémes, mots, etc. . . . La linguistique 
se propose d’établir des catalogues de certains de ces symboles, puis de dégager 
des régles, aussi simples et peu nombreuses que possible, qui permettent de 
distinguer, de fagon aussi précise que possible, les combinaisons de symboles que 
lon trouve effectivement dans des textes, des combinaisons que |’on ne trouve 
jamais, enfin de comparer les symboles et les régles relatifs 4 des groupes indépen.-- 
dants de textes. I] semble assez généralement admis que toute référence & l’aspect 
pragmatique du signe peut étre éliminée de la linguistique proprement dite, si 
celle-ci est limitée 4 |’étude de certaines “structures” (dont il est d’ailleurs 
difficile de donner a priori une définition générale). 

Cette croyance dans |’indépendance de la linguistique et de la physique et de 
la physiologie semble étre basée uniquement sur |’expérience que l’on a des 
organes de la parole: la forme et les dimensions de ceux-ci limitent bien entendu 
la variété des signes que l’on peut émettre, mais il parait superficiellement exact 
que les signes qui restent possibles restent compatibles avec pratiquement n’im- 
porte quelle structure. Dans les études qui suivent, nous nous proposons de mon- 
trer que cetie expérience que l’on a des organes de la parole ne peut pas étre générali- 
sée, et que, tout au contraire, il n'est pas légitime d’éliminer a priori de la linguistique 
toute référence aux conditions physiques et physiologiques de la commv ‘ication. 

Plus précisément, nous montrerons, dans une premiére étude, que l’exristence 
méme d’un aspect “message” de tout texte, distinct de son aspect d’objet phy- 
sique, est une conséquence directe et inévitable de certaines lois physiques de la 
communication. On pourra aussi exprimer ce fait en disant que les conditions 
physiques de la communication entrainent que les symboles linguistiques doivent 
satisfaire 4 des ‘“‘principes structuraux”’: ils sont discrets (‘‘différentiels’’), seg- 
mentés, linéaires, “arbitraires”, etc. ... En d’autres termes encore, si l’on pou- 
vait reconstruire la langue en partant de zéro, on serait obligé de lui attribuer 
les propriétés exprimées par les principes qui viennent d’étre énumérés. 

Dans la deuxiéme étude, nous dégagerons un autre point de vue selon lequel 
les textes sont également tous “‘trés bien faits’’. Nous considérerons tout texte 
comme suite de formes vides, ayant les propriétés étudiées dans la premiére 
partie, toute question de sens et de grammaire étant négligée. Du point de vue 
de |’économie de la transmission et de la réception de ces formes vides, tout ce 
qui compte, c’est leur statistique. Nous verrons quelle statistique on “devrait” 
attribuer aux formes structuralement les plus importantes de la langue, & ses 
“unités concrétes’”’, si, encore une fois, en reconstruisait tout & partir de zéro. 
Cette statistique se revélera étre une généralisation d’une loi empirique d’Extoup 
et de Zipf: elle peut prendre des formes différentes suivant les textes, mais doit 
étre bien déterminée pour chaque texte. Les données de Zipf prouvent que les mots 
suivent cette statistique: il semble donc que ce sont la les unités concrétes, et que 
tor:t texte est bien construit au moyen de ces unités. Nous examinerons aussi d’autres 
propriétés numériques des textes, que nous rattacherons 4 la statistique des 
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mots. Nous dirons que la “‘loi de Zipf”’, généralisée, ainsi que les autres propriétés 
en question, sont “expliquées” par le fait que ce sont les ‘‘meilleures’’ propriétés 
qu’un texte puisse avoir, d’un certain point de vue purement pragmatique, limité 
mais non négligeable. 

Notre concept d’explication est voisin de celui de la physique. On dit qu’on 
a expliqué un fait physique, dont la description est compliquée, ou |’aspect 
inattendu, lorsqu’on a montré que ce fait est une conséquence, logiquement 
nécessaire, d’autres faits qui sont plus simples ou plus familiers, mais que l’on 
ne doit pas considérer comme étant des “‘causes premiéres”’ des faits étudiés. 
De méme, s’il est important de savoir que tout texte “‘obéit”’ & certains critéres 
pragmatiques de qualité, zl n’en résulte nullement qu’il ait effectivement été cons- 
truit pour obéir a ces critéres. Il nous semble que renoncer & la recherche des 
intentions en linguistique serait aussi désirable que de renoncer & la recherche 
des causes en physique. Toutefois, il est incontestable que l’exposé de nos rai- 
sonnements sera plus intuitif pour les linguistes si nous nous placgons du point 
de vue finaliste que la structure de toute texte accomplit certaines intentions, 
et c’est ce point de vue que nous allons, a4 regret, adopter. 

Signalons d’avance le caractére ‘“‘algébrique” que prend nécessairement une 
méthode d’explication par des critéres que l’on pose en quelque sorte comme 
axiomes. Au point de départ, tout ce que |’on suppose au sujet d’un texte est 
que c’est une suite de signes, organisés systématiquement pour permettre 4 des 
sujets de communiquer. I] est impossible de vérifier directement si les axiomes 
que nous posons sont satisfaits par les textes réels, donc on essaye de vérifier 
la chose indirectement, en cherchant des propriétés équivalentes aux axiomes, 
mais observables (et en général moins simples). Pour arriver 4 de telles pro- 
priétés, il faut raisonner sur des textes satisfaisant 4 des axiomes non vérifiables 
directement, c’est-A-dire raisonner sur des objets encore indéterminés. 

Signalons enfin que nos propriétés axiomatiques n’auront pas |’apparence 
“quantitative”, & laquelle on peut s’attendre & cause de l’analogie signalée avec 
la physique. Mais leur apparence ‘‘qualitative” est en fait une traduction de 
leur caractére structural. Les conséquences observables des axiomes s’étendront 
& tout le systéme de signes, tandis que |’on sait que |’influence de certaines 
données physiques quantitatives est limitée. C’est 14 un trait essentiel de cer- 
taines méthodes mathématiques et physiques assez fines, que de permettre de 
déduire des propriétés numériques qui doivent étre partout satisfaites, en par- 
tant de données non pas numériques, mais structurales. En somme, ce que nous 
faisons Jott étre encore considéré comme de la lingutstique structurale, précisément 
parce que les faits pragmatiques que nous considérerons seront eux-mémes struc- 
turauz. 


II. SUR LE CARACTERE DISCRET DES SIGNES LINGUISTIQUES 


Les propriétés des textes considérés dans cette premiére étude se rattachent 
toutes au “principe structural’? que tout texte peui étre considéré comme porteur 
d’un message, composé de signes linguistiques discrets, ou, en d’autres termes, 
différentiels ou discontinus ou digitaux. Ce dernier terme évoque l’image de 
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“compter sur les doigts’’, et ils signifient tous que ]’on peut établir un catalogue 
des signes et les numéroter tous 4 l’aide de nombres entiers (d’une infinité de 
fagons différentes). Deux signes discrets sont, ou bien tout & fait identiques, ou 
bien tout 4 fait differents. Entre deux signes différents de ‘‘valeurs” données, 
en peut avoir des signes de valeurs intermédiaires, mais jamais une gamme 
continue de signes de valeurs intermédiaires. 

Ce qui précéde constitue un principe parfaitement tranché et catégorique, 
c’est-a-dire qui décide que les signes linguistiques appartiennent tous & |’une 
de deux catégories de signes bien déterminées a l’avance. Mais pourquoi ce prin- 
cipe est satisfait n’est nullement évident. En fait, certains linguistes songent a 
Vabandonner pour généraliser |’objet de leur science: ils considérent toute suite 
de signes organisés comme un texte, sans inclure dans la définition le fait de 
comporter un message discret. 

On peut cependant démontrer le caractére privilégié des signes discrets, si 
Yon utilise la théorie statistique de la communication de |’information, due 
surtout 4 Claude Shannon (1948) et 4 Norbert Wiener (1948). Cette théorie 
permet de montrer que le fait que les signes sont discrets est équivalent au fait qu’on 
peut, moyennant quelques soins, les relayer indéfiniment de sujet a sujet, sans 
déformer le message. Nous allons expliquer et motiver cette derniére propriété et 
la prendre alors comme “explication” du caractére discret des signes. 

Tout d’abord, tous les objets physiques susceptibles d’étre utilisés comme 
supports de message sont continus, c’est-A-dire, qu’ils peuvent étre modifiés par 
degrés infiniment petits. (Par exemple, un son est une surpression, localisée dans 
une petite zone de |’air ambiant. L’amplitude et la durée de cette surpression 
peuvent en principe prendre n’importe quelles valeurs.) Cependant des “‘bruzts’’ 
s’ajoutent toujours au signal: il est impossible de mesurer avec une précision 
infinie un signal déja émis, ni de contréler de facgon parfaite l’amplitude et la 
dur‘e d’un signal que |’on veut émettre. On commet toujours, et dans les deux 
cas, des erreurs, en général petites, mais quelquefois considérables. Le point 
essentiel est que les erreurs ne sont pas seulement dues 4 des imperfections élimi- 
nables des instruments qu’on utilise: il existe des lois de la physique qui déter- 
minent les valeurs minima des erreurs de mesure et de contréle. Ces valeurs 
minima, liées aux fluctuations, sont bien sir trés petites, mais dans ce qui suit, 
une erreur trés petite sera aussi néfaste qu’une grosse erreur, car les erreurs s’accu- 
mulent. La différence cruciale se trouvera entre le cas ov |’erreur est nulle, et 
tous ceux ov elle n’est pas nulle. 

Pour le montrer, remarquons que pour communiquer d’un sujet 4 un autre, 
au moyen d’un objet physique continu, on peut utiliser deux procédés, radicale- 
ment différents l’un de l’autre. 

A) On peut supposer qu’d chaque forme du support physique du texte correspond 
un message différent. Or, deux textes physiquement voisins peuvent étre con- 
fondus l’un avec l’autre 4 la réception ou 4 |’émission. I] faut donc introduire 
un concept de ‘‘degré de différence’’ entre deux messages: deux messages dont on 
confond souvent les supports doivent étre “peu différents” si on veut éviter les 
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difficultés; deux messages tout 4 fait differents doivent correspondre 4 des sup- 
ports qu’on ne confond jamais. De tels messages seront dits continus. 

B) Supposons au contraire que le message est discret, c’est-A-dire qu’il n’y a 
pas de degrés de différence, mais seulement différence ou identité. Il faut et il suffit 
alors que des textes différents aient des supports physiques qu’on ne peut jamais 
prendre I’un pour I|’autre. A part cela, ces supports physiques n’ont aucun besoin 
d’étre définis avec précision. 


UN EXEMPLE DE COMMUNICATION NAUTIQUE 


Pour étudier en détail les conséquences du choix entre les deux alterna- 
tives ci-dessus, en ce qui concerne le relais des messages, prenons |’exemple 
concret d’une flotille de bateaux, dans lequel le choix du genre de message pour- 
rait se présenter effectivement. Les messages consistent en des commandements 
& transmettre d’un bateau a |’autre, donnant la direction & prendre par le bateau 
récepteur. Le chef de flottille ne peut pas communiquer directement avec tous 
les bateaux; ses commandements doivent donc étre quelquefois relayés. Pour 
relayer, on fait une copie du texte regu, et, ou bien on garde la copie et on envoie 
original, ou bien on garde |’original et on envoie la copie. Comme nous cher- 
chons une simplification de ce qui se passe avec les signaux acoustiques ou 
visuels, qui sont détruits par la réception, nous allons supposer que chaque rela- 
yeur garde l’original et envoie la copie qu’il vient de faire de cet original. 

Quelle sera la nature du message dans le cas continu? Ce sera un angle quel- 
conque entre 0° et 360°, par exemple tracé sur une carte. L’émetteur commet 
une erreur en ]’émettant et le récepteur y ajoute une erreur de lecture. Ces deux 
erreurs ne sont jamais nulles. Ceci est sans inconvénient s’il n’y a qu’un seul 
émetteur et un seul récepteur, car l’erreur dans le contréle du bateau n’est pas 
nulle non plus, et il suffit que l’erreur dans le commandement soit inférieure 4 
erreur dans |’exécution. Mais que se passe-t-il si le récepteur doit ensuite re- 
transmettre l’ordre? I] n’a aucun moyen de savoir si la valeur de |’angle qu’il 
croit lire sur la carte est bien celle qu’on voulait lui envoyer, ou si elle contient 
des erreurs introduites en cours de route. Par suite, le plus sir, pour chaque 
relayeur, est de reproduire ce qu’il crvit lire. Ainsi, chaque relais ajoute une 
nouvelle erreur de reproduction et une nouvelle erreur de lecture, et, quelque 
petite que soit chaque erreur individuelle, l’erreur totale augmente chaque fois, 
et finira par dépasser |’erreur dans la manoeuvre. Autrement dit, aprés un cer- 
tain nombre de relais, le commandement n’apprend plus rien au récepteur ou, 
si le commandement revenait 4 |’émetteur, celui-ci ne le reconnaitrait pas, et 
ceci quelque soigneux qu’aient été les relayeurs. 

Si on se décidait au contraire pour un message discret, on commencerait par 
choisir un angle unité de l’ordre de grandeur le |’erreur admissible dans la ma- 
noeuvre. Dés lors, on n’enverrait plus d’angle quelconque, mais seulement des 
angles d’abord “‘‘arrondis”’ & l’angle le plus proche qui est un multiple entier de 
Vangle unité. Si le récepteur croit lire sur la carte un angle qui n’est pas un 
multiple entier de l’unité convenue, il sait parfaitement bien qu’il y a eu erreur 
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quelque part. I] ne retransmettra pas ce qu’il croit lire, mais ]’angle admissible 
le plus voisin. En somme, chaque relais permet de corriger l’erreur de trans- 
mission du relais précédent. Donc, avec un peu de soin, on peut relayer un nombre 
infini de fois sans que l’e: -eur totale augmente au dela de l’erreur commise au début 
en arrondissant la direction a prendre. Donc, en fin de compte, |’ “arrondissement” 
initial, qui semblait introduire une perte de précision dans la manoeuvre, se révéle 
étre zndispensable, si on veut que le message soit “‘stable en présence de bruit’’. 
D’ailleurs, dés qu’on a décidé d’utiliser des messages de commandement discrets 
(comme on le fait en pratique dans notre exemple nautique), on ne va plus 
utiliser un angle marqué sur une carte, mais retransmettre le nombre mesu- 
rant cet angle. Ceci souléve le probléme de l’arbitraire du signal portant un 
message discret sur lequel nous allons revenir. 

Remarquons le réle spécial joué dans le raisonnement précédent par le ‘‘temps”’ 
qui s’introduit 4 travers |’ordre de consécutivité des relais. Si le message est 
continu, il change avec le temps, de fagon absolument désordonnée et incon- 
trélable. Mais si le message est discret, il ne change absolument pas avec le temps: 
il est a-chronique. Donc, nous n’avons considéré le temps que pour en annuler 
immédiatement |’influence. Le seul effet diachronique qui soit permissible avec 
des messages discrets est une “‘diachronie externe”’, comme le passage d’un angle 
entier tracé sur une carte 4 un nombre transmis par un autre moyen quelconque, 
par exemple par radio. 

Les considérations qui précédent sont absolument générales. On doit par 
exemple les utiliser pour transmettre par relais 4 longue distance des signaux 
qui n’étaient pas destinés 4 étre relayés, et qui par suite étaient continus. Ces 
signaux doivent étre artificiellement rendus discrets. Le procédé est appelé 
“modulation par impulsions codées’’ (PCM); il n’est pas encore utilisé com- 
mercialement. 


POSITION DU PROBLEME LINGUISTIQUE 


La situation dans la communication linguistique est un peu plus générale que 
celle de la communication nautique que nous venons de considérer. En effet, le 
temps n’intervient plus seulement par |’ordre de consécutivité des relais, mais 
par le fait que tout texte-objet physique a deux dimensions: amplitude de la 
déformation acoustique et sa durée, au lieu de la seule dimension: angle. Toute- 
fois, la conclusion n’est pas changée, sauf qu’elle doit maintenant s’appliquer 4 
deux dimensions au lieu d’une. Pour qu’un message porté par une onde acoustique 
puisse étre relayé, et en particulier enseigné, il faut qu’il soit segmenté en morceaux 
discrets. 

Ces segments se succédent de fagon linéaire, sur un “temps” qui n’est qu’un 
nouvel ordre de consécutivité, différent du temps de l’onde acoustique, et con- 
ceptuellement différent de l’ordre de consécutivité des relais. Nous reviendrons 
sur la linéarité du temps, p. 13 (note 4). Pour le moment, commentons le 
raisonnement que nous venons de faire. 

Tout d’abord, au sujet de l’attitude du relais envers le signal relayé, on peut 
analyser en termes de communication un trés intéressant chapitre du livre de 
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K. Goldstein (1948). Les sujets sont des aphasiques qui ont 4 relayer un “‘texte”’, 
qui est une suite de phonémes de toute sorte mélangés. Ces sujets adoptent l’une 
de deux attitudes. La premiére, que Goldstein appelle “imitative”, consiste & 
considérer le texte comme une onde continue, c’est-A-dire non linguistique, qu’il 
faut reproduire aussi fidélement que possible, sans aucune ‘“compréhension” ou 
“interprétation”’. D’autres sujets n’“imitent” pas, mais, suivant la terminologie 
de Goldstein, ils “‘répétent”’: ceci consiste &4 commencer par essayer de “com- 
prendre’’, c’est-a-dire d’identifier le message qu’ils entendent 4 un message de 
leur propre langue. Cela ne va pas sans difficultés, mais le sujet suppose que le 
message a rencontré quelque part un bruit considérable, qui doit étre éliminé; 
on ne réémet que le message “corrigé” (mais qui est en fait complétement 
modifié). 

Il est évident que c’est la “répétition” qui est l’opération linguistique et 
l’opération la plus facile 4 réaliser pour les sujets normaux. L’“‘imitation’’ exige 
toujours un effort d’attention et elle n’est effectivement praticable que si le 
message imité est trés différent de tout message linguistique admissible dans 
la langue du sujet. Si on imite plusieurs fois de suite, on déforme a chaque fois, 
et on risque de se rapprocher d’un message linguistiquement admissible. Alors, 
malgré les efforts, on ne pourra plus continuer d’imiter, mais on répétera, 
donc le bruit cessera de s’ajouter. 

Un cas analogue & celui de phonémes étrangers est constitué par les airs de 
musique écrits dans des systémes étrangers. Par exemple, Alfred Einstein (que 
nous allons citer de mémoire) raconte qu’un explorateur désirait rapporter une 
mélodie africaine, mais ne savait comment la noter. Faute de mieux, il se |’“‘imi- 
tait”’ matin et soir jusqu’au moment ot un musicien professionnel put |’aider. 
A son retour, ]’explorateur stupéfia les musicologues par l’importance des “in- 
fluences”’ que les airs 4 ce moment & la mode en Europe exercaient sur la musique 
de tribus en apparence tout a fait isolées. L’“‘influence” parut moins étrange 
lorsqu’on apprit que l’air avait été noté par un music’en de cabaret. Mais en 
fait, pouvons-nous ajouter, ce dernier agent était inutile pour expliquer les 
déformations intervenues. L’explorateur lui-méme ne pouvait “‘imiter” indéfini- 
ment; il finit par “‘répéter” une mélodie qui était entre temps devenue euro- 
péenne. 

Tout ce qui précéde dépend essentiellement du fait que l’on conserve |’original 
et retransmet la copie. Si on fait l|’inverse, le message ne doit plus étre discret 
pour pouvoir étre stable. Tel est par exemple le cas des arts plastiques. 


ARBITRAIRE ET DISCRET SONT DEUX QUALITES CORRELATIVES 


C’est lA une remarque de Ferdinand de Saussure (1916) (qui disait différentiel, 
et non discret). Elle a été beaucoup discutée par les linguistes. Ce qui est arbi- 
traire pour Saussure, c’est la relation entre le “signifiant”, que nous appelons 
message, et le signifié correspondant, qui est ce que le signifiant représente, par 
exemple la direction 4 prendre par les bateaux. Nous voyons dés lors que la 
remarque de Saussure est correcte, dans le sens suivant. 

Si le signifiant et le signifié étaient continus, il faudrait qu’é des signifiés 
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voisins, c’est-A-dire partiellement différents, correspondent des signifiants voi- 
sins. La structure des voisinages des signifiants devrait étre la méme que celle 
des signifiés. La correspondance entre les deux ne serait donc pas arbitraire. En 
fait, la plupart de structures de voisinage de signifiés ne seraient pas réalisables 
avec des signifiants 4 support acoustique. 

Si, au contraire, signifiant et signifié sont discrets, il n’y a plus aucune question 
de voisinage; il y a identité ou différence. Le signifiant est un élément de nomen- 
clature, une sorte de nombre de référence, et on peut le changer & volonté, a 
condition que des signifiés différents aient des signifiants différents. 

En somme, comme il est nécessaire que les signifiants soient discrets, il est 
néicessaire que la correspondance entre signifiant et signifié puisse étre arbitraire. 
Mais ceci ne veut nullement dire qu’elle puisse &tre changée 4 volonté, une fois 
établie, ni que le choix de la correspondance doive étre fait au hasard; mais, au 
contraire, que |’on dispose initialement d’un choix, que |’on peut faire rationnel- 
lement, & condition que le critére de rationalité qu’on va choisir ne contredise 
pas le critére de rationalité qui précéde. Dans la deuxiéme étude, nous pro- 
poserons un tel critére, et montrerons qu’il semble pouvoir expliquer la statis- 
tique sous-jacente au choix de certains signifiants discrets: les mots. 


CARACTERE DISCRET DES SIGNES ET REPRESENTATION DES IDKES 


Les conditions que l’on vient d’imposer a la structure du message de tout texte 
sont sans aucun rapport avec le but réel de la communication linguistique et con- 
cernent seulement les moyens qui existent de ne pas s’égarer de tout but. Mais, en 
fait, les textes ainsi restreints permettent quand méme de communiquer; c’est 
donc que les moyens qui restent sont quand méme suffisants, pour certains buts 
tout au moins. La question de savoir s’il y a des idées non communicables 4 
cause du caractére discret des messages linguistiques n’est pas une question 
scientifiqu®, mais une question philosophique. Certains philosophes déplorent 
Vappauvrissement des idées par leur représentation linguistique. D’autres met- 
tent en question l’existence méme d’une pensée qui serait inexprimable pour 
des raisons techniques, et de toute pensée avant son expression linguistique. 
La théorie des communications ne permet de rien ajouter & ce sujet. 


RELATION ENTRE LES SIGNES DISCRETS ET LES SONS CONTINUS 


Pour résumer, si les signes linguistiques sont discrets, c’est parce qu’ils sont 
“‘nortés’”’ par des sons continus. Mais quelle est, dans le textes réels, la relation 
entre ces deux aspects un peu contradictoires? En particulier, existe-t-il des 
signes linguistiques ultimes inanalysables, mais correspondant chacun 4 des 
propriétés bien déterminées du texte-objet physique, et par suite ayant des pro- 
priétés intrinséques? La réponse doit venir de |’expérience, mais il faut mettre 
en garde contre les pseudo-unités auxquelles on peut aisément ramener tout 
systéme de signes discrets. Prenons encore une fois notre signal nautique comme 
exemple. Supposons que la manoeuvre se fasse 4 un degré prés, et que tous les 
commandements transmis comme angles entre, disons, 23,5° et 24,5° signifient 
qu’il faut prendre la direction faisant angle de 24° avec le Nord. Les signes du 
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message sont donc les nombres entiers entre 1 et 360. II est clair qu’un message 
tel que: 234 admet la décomposition: 2, suivi de 3, suivi de 4. Mais les “‘signes”’ 
2,3,4 n’ont absolument aucune existence indépendante: il n’y a aucun caractére 
physique commun aux objets physiques portant les messages commengant par 2. 

L’utilité de la théorie des communications en ce qui concerne la théorie des 
phonémes dépasse d’ailleurs de telles mises en garde. Cf. Jakobson, Halle et 
Fant (i952) et Cherry, Halle et Jakobson (1953). 


SYNCHRONIE DU POINT DE VUE DU CARACTERE DISCRET 


Nous avons remarqué que les messages discrets ne sauraient admettre qu’une 
“diachronie externe” qui ne changerait pas la structure. Une “diachronie in- 
terne’’, structurale, suppose que le caractére discret n’est pas établi de fagon 
parfaite. Par exemple, soient deux phonémes tendant 4 se confondre. Entre le 
moment ou ils sont différents et celui od ils sont identiques, il doit y avoir un 
temps ow ils sont partiellement les mémes, c’est-A-dire od, ayant recu l’un, on 
se trompe de plus en plus souvent et on émet |’autre. Le fait que tous les signes 
restent discrets peut donc étre adopté comme premiére définition, assez peu 
astreignante, de la période de synchronie. Par ailleurs, toute loi diachronique est 
nécessairement en contradiction avec la régle de stabilité qui motive le caractére 
discret. Ceci est d’ailleurs tout 4 fait normal, puisque les effets diachroniques 
réels ne s’introduisent qu’au bout d’un nombre extrémement grand de répéti- 
tions, quand la stabilité par ‘“‘répétition” ne peut plus étre maintenue. 


REMARQUE NEURO-PHYSIOLOGIQUE 


On ne peut manquer de rapprocher le caractére discret des signes linguistiques 
du fait que les cellules nerveuses (qu’on appelle les neurones) fonctionnent par 
“tout ou rien’’, c’est-A-dire sont susceptibles d’étre excitées, ou d’étre au repos, 
sans degrés intermédiaires. Par suite, tout message, discret comme les signes 
linguistiques, ou continu comme les images visuelles, doit nécessairement finir 
par étre représenté de facon discréte. Ce fait aurait bien pu avoir une influence 
sur la structure des seuls signes dont on peut contrdéler la structure: les signes 
linguistiques. Dans ce cas, un chapitre de neuro-physiologie aurait fait partie 
de la linguistique. Cependant, rien n’est connu dans ce domaine, et comme on 
peut s’en passer en opérant comme nous |’avons fait, il vaut mieux éviter la 
neurophysiologie. 

Nous recommandons au lecteur de se reporter 4 ce point a |’introduction 
de l'article. 


III. SUR LES MOTS COMME UNITES CONCRETES DE TOUT TEXTE 
(GENERALISATION ET EXPLICATION DE LA LOI STATISTIQUE D’ESTOUP ET DE ZIPF) 


Comme base de cette deuxiéme partie, on suppose acquis que le message de 
tout texte est une suite linéaire de segments discrets. Les propriétés statistiques 
de ces segments sont faciles 4 étudier, et on peut s’y prendre de plusieurs fagons. 
Nous allons nous occuper de certaines de ces propriétés statistiques, qui présentent 
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le double caractére d’étre assez fines et d’étre vraies pour tout texte, quelle que 
soit la langue dans laquelle il est écrit, et quel que soit son auteur. 

Ces propriétés ne se présentent pas comme des principes catégoriques du type 
de ceux de la premiére étude, mais comme des tableaux de données numériques 
ou comme des courbes expérimentales. I] est impossible d’‘expliquer”’ directe- 
ment de tels tableaux ou courbes. II faut d’abord les remplacer par des formules 
mathématiques, qui sont en quelque sorte équivalentes 4 des principes catégo- 
riques, puis chercher des propriétés de ces formules, qui puissent étre considérées 
comme “nécessaires’’, et par suite puissent ‘“expliquer’’ les données de départ. 
La méthode est donc différente de celle de la premiére étude. D’ailleurs, la 
représentation des données par des formules introduit une certaine indétermina- 
tion dans toute théorie de ce genre. En effet, la formule n’est jamais uniquement 
déterminée: si l’on tient compte du fait que tout nombre exprimant une mesure 
est entaché d’erreur expérimentale, on voit qu’on a le choix entre des formules 
simples, mais seulement c.pproximatives, et d’autres formules, plus exactes, mais 
d’autant plus compliquées. Suivant la formule choisie, les conséquences logiques 
pourront beaucoup différer, ce qui est trés dangereux, étant donné le caractére 
arbitraire du choix de la formule. De plus, nous étudions plusieurs phénoménes 
associés; si on les représente par des formules indépendamment les uns des 
autres, il y a de fortes chances pour que les conséquences que |’on tirera de ces 
formules soient incompatibles les unes avec les autres. 

Il faudra donc mener de pair la représentation des données numériques par des 
formules, et explication des données @ travers les formules. De plus, on ne consi- 
dérera une explication comme naturelle, et non “ad-hoc’’, que si les critéres 
choisis expliquent également d’autres phénoménes, d’abord inconnus. 

C’est effectivement de cette fagon qu’avait progressé la théorie dont nous al- 
lons présenter les lignes générales, en renvoyant pour les détails mathématiques 
& nos articles cités dans la bibliographie.* Notre point de départ était un compte- 
rendu critique du livre de G. K. Zipf (1949). Ce travail disait qu’Estoup (1916) 
et Zipf avaient établi que les données empiriques relatives aux fréquences des 
divers mots dans un texte peuvent étre représentées par la loi suivante dite 
“‘harmonique’’: Numérotons les mots @ l’aide de leurs rangs dans le classement par 
fréquences décroissantes; la loi harmonique est satisfaite si la fréquence de tout mot 
est inversement proportionnelle a son rang. Si on appelle le rang “‘r’’, et la fréquence 
correspondante ‘‘p,”’, on a: p, = P/r, ov P est une constante pour tout texte. 

D’aprés Estoup, Zipf, et al., la loi s’appliquerait aux mots-formes (deux 
formes grammaticalement différentes d’une méme unité du dictionnaire sont 
considérées comme des mots différents). Ceci implique que les fréquences des 
mots-formes ont quelque chose d’intrinséque, ce qui parait tout 4 fait étrange a 
priori. Il semble en effet que le choix d’un mot soit toujours dicté par le besoin 
que |’on a de l’utiliser, et non par quelque désir de satisfaire & une loi, dont 
l’énoncé lui-méme exige déja la considération de textes de longueur assez grande. 


3 Signalons également qu’un bref résumé de ces détails mathématiques vient d’étre 
donné par G. A. Miller (1954). 
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De plus, méme si une loi devait exister, il semble qu’elle devrait différer de texte 
& texte (tout en restant décroissante, par définition du rang). 

Cependant, qu’on le veuille ou non, la loi semblait établie, et l’impression de 
liberté que ]’on croit avoir dans le choix des mots devait donc étre erronnée, ainsi 
que l’impression qu’a chaque idée correspond un mot, puisqu’il est clairement 
absurde de supposer que les fréquences des idées suivent une loi quelconque. 
Zipf, suivant l’auteur du compte-rendu, aurait dés lors postulé que la loi harmo- 
nique constitue une propriété du systéme des signes, indépendante des idées, 
et aurait suggéré que cette propriété peut étre expliquée par un “principe de 
moindre effort’’, indépendant du sens des mots. Mais, en fait, son principe 
n’était nulle part explicitement énoncé, et il était clair que |’“‘explication” de 
Zipf était verbale, ou, au mieux, une suggestion pour la recherche d’une explica- 
tion scientifique. 

Pour trouver une vraie théorie, nous avons songé & la théorie de la communi- 
cation de l’information de Claude Shannon (1948) et de Norbert Wiener (1948). 
Cette théorie s’occupant elle-méme des formes vides, elle serait susceptible de 
donner un sens précis & la quantité 4 minimiser, et de définir des quantités qui 
doivent rester constantes dans cette minimisation (puisque, quelle que soit la 
définition de |’“‘effort’’, celui-ci est nul si on ne fait rien). 

Le nombre moyen de lettres ou de syllabes par mot n’est pas une bonne définition 
de l’effort, c’est-A-dire ne méne nulle part. Nous avons alors pensé qu’un “effort’’ 
susceptible d’influencer la structure des textes de facon précise devrait &tre re- 
latif 4 quelque étape plus critique du circuit de transmission qui va de la bouche 
du premier interlocuteur & l’orzille du deuxiéme, puis & son cerveau, puis de 
nouveau & la bouche, etc. ... Hn fait, une suite de lettres ou de phonémes ne 
sont que deux facons particuliéres de décrire le message que contient le texte- 
objet physique “‘public’’. Dans le cerveau récepteur, le texte est un autre objet 
physique, “privé”. Cet objet doit transmettre le méme message, donc comporter 
certains éléments discrets en correspondance un-a-un avec certains éléments 
discrets du message du texte public. Mais tl n’est pas nécessaire qu’il y ait cor- 
respondance entre TOUS les éléments discrets. En d’autres termes, le cerveau aurait 
fort bien pu “recoder’’ mot-par-mot, mais pas lettre-par-lettre. Les nouvelles 
“lettres idéales’’ ne seraient plus conditionnées par les mémes éléments arbi- 
traires, par exemple historiques, qui conditionnent les phonémes ou les lettres 
ordinaires; i] suffirait que deux mots différents aient des orthographes différentes. 
Nous avons postulé qu’a part cela, le récepteur va s’‘adapter” a la structure 
statistique du message recu, c’est-a-dire utilisera la suite de “lettres idéales’’ la 
plus courte, donc la moins cofiteuse pour le mot le plus fréquent, et ensuite des 
suites de lettres de cofits croissants pour coder les mots classés par fréquence 
décroissantes. 

Quelle doit étre d’autre part la quantité qu’on supposera constante dans la mini- 
misation du cott? Intuitivement, ce doit étre une sorte de “‘variété” dans |’uti- 
lisation du vocabulaire. Ce concept peut étre rendu rigoureux si on |’identifie & 
ce que Shannon (1948) appelle “information”, et qui est une mesure, non pas 
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de ce que chaque mot apprend en moyenne, mais de |]’élément de surprise, 
fructueuse ou non, qu’apporte en moyenne la réception de chaque nouveau mot. 

Il est assez facile de trouver mathématiquement pour quelle statistique des 
mots le cofit moyen de recodage par mot est minimum, lorsque la variété- 
information a été fixée. Cette statistique optimale se traduit par une loi rang- 
fréquence, que nous appelerons “‘canonique’’, parce qu’elle va servir de “canon de 
référence” dans ce qui suit. La loi canonique dépend du nombre de mots dont 
on dispose et de la variété avec laquelle on veut les utiliser. Sa forme est: p, = 
P(r + p)-®, ot p et B sont des constantes, comme P. Cette loi est expliquée 
en détail plus loin. On peut cependant déja remarquer que si B = 1 et p = 0, 
on retrouve la lot harmonique, qui est un cas particulier de la loi canonique. Au- 
trement dit, pour démontrer que les mots ‘“‘doivent”’ suivre la loi harmonique, 
nous devions encore démontrer que B et p doivent prendre toujours certaines 
valeurs particuliéres, ce qui, dans notre théorie, signifie que tout texte doit com- 
porter le méme nombre de mots et la méme variété dans |’emploi de ces mots. 

I] paraissait impossible de démontrer ce dernier résultat, et par conséquent 
de démontrer la loi d’Estoup et de Zipf par notre méthode. Cependant, quand 
nous avons réussi 4 mettre la main sur le livre de Zipf, et 4 examiner les données 
que nous ne connaissions encore que par le compte-rendu, nous avons constaté 
de toute évidence que, bien que la formule de Zipf donne |’“‘allure générale” des 
courbes, elle en représente trés mal les détails. De plus, ainsi que nous venons 
de le montrer, elle ne peut pas étre expliquée. Au contraire, notre formule cano- 
nique représente de fagon excellente les statistiques de tous les textes étudiés 
par Zipf, 4 condition de choisir, séparément pour chaque texte et de fagon con- 
venable, les nombres B et p, ou, ce qui est équivalent, le nombre de mots du 
vocabulaire et la variété de leur utilisation. Donc, comme il se doit, on peut 
tenir compte des différences entre les textes, du point de vue du nombre de mots 
du vocabulaire, et de la variété d’emploi de ce vocabulaire; et malgré cela, on 
peut dégager une propriété de “‘moindre quelque chose’’, commune 4a tous les 
textes. 

Finalement, pour revenir au probléme méthodologique du choix de la formule 
représentant les données expérimentales, la formule de Zipf a servi de point de 
départ 4 notre théorie; mais celle-ci s’est ensuite développée indépendamment, 
et on n’a dé introduire aucune hypothése ad-hoc pour revenir effectivement au 
point de départ. L’autre condition méthodologique est que la théorie doit per- 
mettre d’expliquer d’autres propriétés des textes que celle dont elle était partie. 
Cette condition sera également satisfaite. 

Nous allons maintenant reprendre notre théorie de la statistique des mots, de 
facon systématique et détaillée. De plus, au lieu de partir des résultats connus 
pour “deviner” le probléme auquel répond la structure de la langue, comme 
nous venons de le faire, nous allons dégager pas & pas le probléme a partir de 
considérations trés générales, puis le résoudre. Le lecteur qui ne s’intéresse pas 
& ces détails peut directement sauter 4 la p. 20, od reprend |’étude de la loi 
canonique. 
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CONSIDERATIONS SUR LA RECEPTION D’UN SIGNAL PORTANT UN MESSAGE DISCRET 
ET SEGMENTE 


Le message est d’abord porté par un “texte public”, constitué par une onde 
acoustique; puis il est porté par des impulsions nerveuses “‘privées’’, qui sont les 
résultats d’opérations que l’appareil récepteur effectue sur le texte public. Nous 
savons de la premiére étude qu’il n’est nullement nécessaire que tous les segments 
discrets que |’on distingue dans le message aient des supports physiques bien 
déterminés. Rien n’implique non plus que fous les segments discrets ayant un 
support dans le texte public en aient également un dans le texte nerveux privé. 
Cependant, pour que la correction des erreurs, potentiellement permise par le 
caractére discret des signes, soit effectivement possible, il faut qu’il y ait au moins 
une classe de signes discrets, commune & tous les codages. La réception et le 
décodage d’un texte doivent done comporter le découpage du texte en tranches. 
Chaque tranche doit étre identifiée 4 tour de réle, et les résultats des opérations 
d’identification constituent le nouveau code. Donc, décodage est synonyme de 
recodage. Les tranches spéciales utilisées dans le décodage ont deux représenta- 
tions discrétes, une publique et une privée, dont chacune est sans doute cons- 
tituée d’unités discrétes plus élémentaires. Mais, d’aprés ce qui précéde, il ne 
doit nullement y avoir correspondance entre toutes ces unités. Ceci est analogue 
& ce qui se passe avec les deux représentations des mots constituées par leur 
prononciation et leur orthographe. Il y a toujours correspondance un 4 un entre 
les mots, mais, en anglais par exemple, il n’y a presque aucune correspondance 
entre phonémes et lettres.‘ 

Dés lors, si le systéme de décodage est ‘‘adapté” 4 la structure de la langue 
décodée, le décodage doit utiliser les coupures structurales du texte décodé. 
Réciproquement, les coupures structurales du texte sont celles que |’on choisit 
systématiquement dans le décodage. La réception nous fait ainsi retrouver un 
concept d’‘‘unité concréte’’ dela langue, qui parait analogue a un concept assez vague 
de Saussure. 

Comment peut-on cependant définir les coupures structurales indépendam- 
ment du décodage? Considérons des segments discrets superposés, phonémes, 
morphémes, mots, propositions, textes. Supposons que |’on veuille étre absolu- 
ment correct, c’est-4-dire absolument a-chronique dans I’utilisation de la langue. 
Il faut pour cela ne jamais utiliser que des tournures déja utilisées auparavant. 


‘Comment se présente le probléme de la dimension du temps, du point de vue du déco- 
dage? Le temps est simplement un ordre de consécutivité des tranches. Lorsqu’on ne con- 
sidére qu’une espece de tranches, par exemple les mots, la linéarité du temps est une con- 
séquence de ce que le temps physique a une seule dimension. Mais supposons qu’il y ait 
deux genres de tranches, par exemple les unités concrétes et les opérations élémentaires de 
décodage. Pendant que 1’on recode une unité concréte, le temps de consécutivité des 
unités concrétes est arrété, et un autre temps joue, le temps de consécutivité des opéra- 
tions élémentaires. Done, il y a en fait deux dimensions, qui différent par ce que |’une 
d’entre elles ne se développe pas jusqu’a |’infini. En somme, il n’y a pas de différence 
de dimensionalité entre la langue ordinaire et, par exemple, le systtme de drapeaux men- 
tionné par de Saussure. 
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C’est d’ailleurs effectivement ce qu’on essaye de faire lorsqu’on enseigne une 
langue par la “méthode vivante” (enseigner des phrases et non des segments), 
ou par la méthode des classiques (on n’utilise que les tournures d’un trés petit 
nombre d’auteurs). Si au contraire on acceptait n’importe quel effet diachronique, 
qui ne détruirait pas le caractére discret des signes les plus élémentaires, il 
faudrait admettre comme texte n’importe quelle combinaison de phonémes. 
Dans la réalité, dans un état de la langue que |’on considérerait intuitivement 
comme synchronique, on se place entre ces deux extrémes: les possibilités d’appa- 
rition des formes les plus courtes sont nulles, celles des formes plus longues, 
comme les mots, sont faibles, mais celles des formes au-dessus des mots sont 
extrémement grandes, pratiquement infinies. Dés lors, |’hypothése de Saussure, 
et la nétre, que tout texte est constitué d’unités concrétes, équivaut 4 identifier 
la langue par approximation 4 un systéme ow les possibilités d’apparition de 
nouvelles formes sautent brusquement de zéro & |’infini, lorsque la longueur des 
formes dépasse la longueur des unités concrétes. 

Nous supposons de plus que les unités concrétes sont en étroite liaison avec 
la méthode de réception. Nous devons donc chercher quelies propriétés véri- 
fiables ces unités devraient avoir, pour étre bien choisies du point de vue de la 
réception. Quand nous aurons montré que ces propriétés, de nature statistique, 
sont effectivement vérifiées dans le cas des unités linguistiques: les mots, nous 
aurons, partiellement au moins, justifié a posteriori notre choix de la méthode 
pragmatique pour déterminer les unités concrétes, de préférence 4 la méthode 
purement linguistique. 

Le récepteur-décodeur est une machine, en fait une sorte de machine 4a cal- 
culer. Pour les besoins de la théorie, nous nous imaginons étre dans un état od 
il est possible de construire la machine en partant de zéro, sans tenir compte 
des machines précédemment utilisées. I] faut quand méme, en établissant les 
plans, tenir compte de deux classes de considérations économiques: d’une part, 
les considérations d’investissement initial, d’autre part, les considérations de 
rendement en cours de marche. 

L’investissement initial augmente avec la vitesse de fonctionnement qu’on 
désire atteindre et dont nous faisons abstraction, et avec la mémoire de la ma- 
chine, c’est-d-dire la longueur de texte qu’elle peut stocker avant de recoder. 
Par exemple, pour recoder phonéme par phonéme, il ne faudrait stocker que des 
petits morceaux de texte; pour recoder phrase par phrase, il faudrait stocker de 
trés longs morceaux. Une machine dont les unités concrétes seraient les phonémes 
aurait besoin de moins de mémoire qu’une machine dont les unités concrétes 
seraiex:t les phrases, et serait moins chére 4 construire. Nous prendrons, comme 
mesure du coiit initial de construction, le délai moyen dont la machine est capa- 
ble dans le recodage unité par unité. (Pour montrer |’importance des possibilités 
de stockage en ce qui concerne le changement de code, il est bon de considérer 
les enfants qui apprennent a écrire sous la dictée. Au début, quand la machine 
est encore peu développée, il faut dicter syllabe par syllabe; puis, on dicte mot 
par mot; enfin petite phrase par petite phrase.) 

Quant au rendement en cours de marche, on peut le définir de bien de fagons 
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différentes. Nous prendrons comme définition la “‘variété” dans l’usage des mots, 
divisée par le nombre d’opérations utilisées en moyenne pour décoder un mot. 
On montrera plus loin qu’il existe une expression trés intrinséque de la variété, 
qu’on appelle |’‘information’”. Mais quelle que soit la variété, le rendement 
augmente avec le coft d’investissement. 

Ce qu’on aurait voulu avoir, c’est un rendement aussi grand que possible avec 
un investissement initial aussi petit que possible. Mais on vient de voir que ces 
conditions sont parfaitement incompatibles. I] faut donc établir un compromis 
entre elles: on peut par exemple fixer ]’investissement, c’est-A-dire le délai, et 
augmenter le rendement au maximum, ce qui se réduit 4 rendre la variété 
maxima. On peut aussi fixer la variété, et minimiser le délai. On peut montrer 
que ces problémes admettent tous deux la méme solution: chaque unité doit 
avoi. une fréquence d’émission bien déterminée (on dit alors que les unités sont 
“ergodiques’”’) et ces fréquences doivent suivre une loi bien déterminée, que 
nous appelerons “canonique’’. 

Le probléme du compromis entre délai et variété doit &tre lui-méme résolu 
en deux étapes. D’abord, on suppose que la structure statistique du message 
recu est donnée, et on cherche & construire le décodeur de cot moyen—mesuré 
par le délai—donné. Nous dirons par définition qu’on adapte le décodeur 4 la 
statistique du message. Cette adaptation est de nature psycho-physiologique indi- 
viduelle. Dans la deuxiéme étape, on suppose que le délai moyen en décodage 
optimal est fixé, et on cherche quelle structure du message initial apporte le 
maximum d’“‘information”’; ou bien, ce qui méne au méme résultat, on suppose 
que l'information que |’on veut apporter est fixée, et on cherche a |’apporter 
avec un délai moyen en décodage minimum. Nous dirons par définition gu’on 
adapte le message au décodeur. Pour interpréter cette deuxiéme étape, il faut 
supposer que |’émetteur connait effectivement le fonctionnement du décodeur 
de son partenaire, ce qui est impossible; ou qu’on attribue & l’interlocuteur ses 
propres réactions, c’est-a-dire que, par une identification intersubjective, ‘‘on 
pense & |’interlocuteur comme pensant’’, et désirant recevoir précisément les 
messages que l’on désire soi-méme recevoir. 


ADAPTATION DU DECODEUR A LA STRUCTURE DU MESSAGE, CONSIDERE COMME 
SUITE D’UNITES CONCRETES 


Nous supposons d’abord, ce qui est absolument essentiel pour la suite, que le 
nombre des opérations élémentaires de décodage qui sont disponibles est beau- 
coup plus petit que le nombre d’unités, c’est-a-dire qu’aprés recodage, chaque 
unité est encore une suite de segments plus courts, dont chacun correspond au 
résultat d’une opération de décodage. Par ailleurs, on suppose que les opérations 
élémentaires sont telles que tout2 opération peut suivre toute autre (il n’y a pas 
de combinaisons interdites) et que toute suite d’opérations recodant une unité 
est suivie d’un symbole “intervalle’’. 

Coimment distribuer alors les opérations élémentaires et leurs combinaisons 
parmi les diverses unités concrétes? Le seul moyen d’agir rationnellement est de 
supposer que les messages qu’on recevra a |’avenir auront la méme structure que 
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les messages que |’on avait recus dans le passé. Par ailleurs, le seul élément du 
passé qu’il faut extrapoler est constitué par les fréquences des unités concrétes. 
On va toujours attribuer 4 chaque mot une probabilité a priori égale & sa fré- 
quence et faire correspondre la méthode de recodage la plus courte a l’unité la 
plus fréquente, et les méthodes de plus en plus longues, c’est-a-dire cofiteuses, 
aux unités classées par ordre de fréquences décroissantes. 

Supposons que les unités soient déja classées par ordre de fréquences dé- 
croissantes. Cherchons le coft du décodage correspondant 4 l’unité de rang r 
dans ce classement. I] semble 4 priori que ce cofit devra dépendre des cofits 
(durées) des opérations élémentaires de décodage dont on dispose; comme ces 
derniers cofits ne peuvent étre effectivement mesurés, la théorie reposerait sur 
une base arbitraire, dont on pourrait tirer n’importe quoi. Mais, en fait, 1’in- 
fluence des cofits des opérations élémentaires se révéle étre limitée. Supposons 
par exemple que |’on ait dix opérations élémentaires, et classons ces opérations 
et leurs combinaisons par ordre des cofits croissants des combinaisons. Pour 
connaitre les quelques vingt combinaisons les moins chéres, les cofits des opéra- 
tions élémentaires sont d’abord essentiels, puis seulement importants. Mais au 
dela de vingt, on voit apparaitre une sorte de phénoméne de compensation, et la 
variation du codt en fonction du rang par codt croissant finit par prendre une 
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forme universelle, traduite, par définition, par la “fonction logarithme’’. (Re- 
marquons d’avance que la zone logarithmique va expliquer le caractére linéaire 
des courbes de Zipf pour des rangs assez élevés, tandis que la zone initiale ov les 
cofits individuels sont importants va expliquer l’applatissement des courbes de 
Zipf pour les petits rangs.) Pour la forme de la fonction logarithme, voir la figure 
de la page précédente. 

Donec, en cherchant & rendre le cofit moyen de décodage aussi petit que pos- 
sible, on attribue 4 chaque unité un cofit indépendant de sa fréquence exacte, 
mais proportionnel au logarithme de son rang par fréquences décroissantes. 

Cependant, on n’a d’intérét 4 effectuer une adaptation du décodage au mes- 
sage, que si le processus de décodage ne doit pas changer constamment. Donc, 
st on reconstruisait le message librement, on s’arrangerait pour que la fréquence de 
chaque unité varie peu, de facon que l’ordre par fréquences décroissantes soit, lui, 
invariable. Cette condition de fréquences presque constantes ne peut pas bien 
entendu étre réalisée au cours de chaque courte période, d’abord 4 cause des 
fluctuations dues au hasard, puis 4 cause des contraintes sémantiques. La con- 
stance des fréquences ne peut pas non plus étre réalisée au cours de trés longues 
périodes, mais nous allons considérer ces variations & longue période comme 
faisant partie des effets diachroniques. Tout “‘texte’”’, suivant notre définition 
de la p. 1, doit tomber tout entier dans une période de synchronie, ov |’ordre des 
unités concrétes dans le classement par fréquences décroissantes reste inchangé. 
Cette nouvelle définition de la synchronie est plus astreignante que la définition 
liée au caractére discret des formes linguistiques, qui avait été donnée dans la 
premiére étude, p. 9. 


ADAPTATION RECIPROQUE DE LA STRUCTURE DU MESSAGE AU DECODEUR QUI 
VIENT D’£TRE DETERMINE 


Lorsque l’adaptation dont nous venons d’achever |’étude est réalisée, le 
récepteur a fait tout ce qu’il peut pour diminuer le cofit de décodage. Mais ce 
coiit de décodage ne dépend pas seulement de la maniére dont les signes ont été 
manipulés, mais aussi de la statistique exacte de ces signes. En d’autres termes, 
si l’émetteur se “‘coalise’’ avec le récepteur en n’envoyant que certains messages 
de structure particuliére, le coit de décodage peut étre rendu encore plus petit, 
aussi petit que le permet la variété que |’on désire dans |’emploi des unités. 
Ou encore, ce qui est plus intuitif, supposons que le coft soit limité (tout comme 
pour un enfant il y a une limite au nombre de lettres qu’il peut se rappeler 4 
la fois sous la dictée) et essayons de ne jamais transmettre que des messages dont 
la variété est aussi grande que le permet la limitation du coft. 

“Variété” est pour le moment un concept indéfini. Afin de le préciser par un 
concept bien défini il faut d’abord énumérer les propriétés que ce concept devrait 
avoir. Si toutes les unités avaient des fréquences égales, un texte serait, bien 
sir, considéré comme étant trés varié si le nombre d’unités est trés grand, et 
peu varié si ce nombre est petit. Donc la variété d’un texte a unités de fréquences 
égales doit étre une fonction croissante du nombre de ces unités. Supposons 
maintenant que les unités aient des fréquences inégaies: par exemple, soient 
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1009 unités, dont 9 sortent chacune 1 fois sur dix, et les 1000 autres, 1 fois sur 
10.000. Ce texte serait considéré comme beaucoup moins varié qu’un texte & 1009 
unités équifréquentes, et comme un peu plus varié qu’un texte a 9 unités équi- 
fréquentes. Donc, la variété doit de toute fagon dépendre des fréquences relatives 
des unités. Si, de plus, on se limite aux problémes de la manipulation des formes 
vides, les fréquences sont les seules choses dont la variété puisse dépendre. 
Shannon a trouvé une expression particuliérement intrinséque de cette variété, 
qu’il a appelée “information”. Pour éviter les innombrables connotations quali- 
tatives de ce deraier terme, comme d’ailleurs du terme “variété’”’, nous parlerons 
toujours de “‘variétéinformation”’, qui devra étre toujours entendue dans le sens 
technique précis introduit par Shannon. (Signalons la forme exacte de I’informa- 
tion: si les unités W, ont respectivement les probabilités p,, l’information est 
définie par H = —2 p, log p,. En d’autres termes, on fait un tableau des p,, 
puis 4 cété un tableau des logarithmes des p,, enfin un tableau des produits des 
termes des deux premiers tableaux. Alors, on fait la somme de tous les termes du 
troisiéme tableau, et ce que l’on obtient est par définition la variété-information.) 

Dés lors, posons le principe suivant: un ‘“‘bon texte” “doit”? comporter le 
maximum de variété-information par unité, compatible avec une limitation de 
la durée de décodage moyenne par unité (ou, de facgon équivalente, doit comporter 
la plus petite durée de décodage—considérée comme coitit ou “effort”’—compatible 
avec la variété-information que l’on désire). De ce principe, on peut déduire 
mathématiquement que les unités doivent suivre une distribution statistique 
trés particuliére, ot la fréquence, ou plutét probabilité a priori, du mot de rang 
r pa: fréquences décroissantes est donnée par la formule suivante, dite 
canonique: p, = P(r + p)~”; ou encore: 


log p, = log P — B log (r + p) 


Cette formule ne doit pas effrayer les linguistes: elle signifie simplement ceci: 
Prenons le tableau des fréquences et des rangs obtenu par décompte d’un texte; 
prenons une feuille de papier dit bi-logarithmique, dont les graduations ne sont 
pas des petits carrés, tous égaux, mais diminuent lorsqu’on va de gauche 4 droite, 
ou de bas en haut; reportons les résultats du décompte sur ce papier log- 
arithmique, en nous servant des graduations qu’il comporte. Alors, la courbe qui 
joint ces points doit étre une ligne droite de pente négative égale 4 B, sauf pour 
les unités les plus fréquentes, dont les fréquences doivent tomber en dessous de la 
droite de pente —B. Voir la figure de la page suivante. 

En d’autres termes, pour tout i... , B et p sont des constantes, tandis que 
la rang r prend une valeur différente pour chaque unité, mais toujours inférieure 
au nombre total d’unités disponibles, que nous désignerons par R. 

Pour mieux expliquer le réle des nombres R, B et p, appelés “paramétres” du 
texte, on peut les comparer aux tailles d’un véteraent. Pour un vétement donné, 
les tailles sont fixes. De méme, pour un texte donné, les paramétres sont fixes. 
Mais les vétements que peut coudre un tailleur n’ont pas nécessairement tous 
la méme taille; le tailleur prend les mesures du client, c’est-A-dire quelques nom- 
bres donnant une idée trés schématique de la forme du client, et coud, c’est-a- 
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lois canonique et harmonique 


dire construit, un vétement a ces tailles. De méme, les textes satisfaisant a la 
loi canonique n’ont pas tous les mémes paramétres; le constructeur de bonne 
langue prend les mesures du client (dans ce cas le délai qui lui est permis dans le 
codage unité par unité), qui donnent encore une idée trés schématique de ce 
client, et construit une structure statistique 4 ces mesures. Dans les deux cas, 
toute la structure est déterminée par quelques nombres; mais, dans le cas du véte- 
ment, cette détermination fait intervenir un “canon” tout a fait arbitraire 
d’élégance masculine, tandis que dans le cas de la langue, le canon est tout @ fait 
intrinséque: c’est économie de la réception. Quant au nombre P, ce n’est pas un 
paramétre indépendant, car il faut que la somme de toutes les probabilités soit 
égale 4 1, ce qui, lorsque R, B et p sont fixés, n’est possible que pour une seule 
valeur déterminée de P. En somme, P est analogue & la largeur des manches d’un 
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vétement, qui est déterminée par les canons d’élégance, une fois que les tailles 
ont été déterminées. 

Commentons dés lors le réle de la loi empirique d’Estoup et de Zipf. Cette loi 
correspond & B = 1 et & p = 0, donc a des tailles particuliéres. Lorsque Zipf 
essayait de représenter tout par cette loi, il essayait d’habiller tout le monde avec 
des vétements d’une seule taille. 


IDENTIFICATION DES UNITES CONCRETES 


Les unités sur lesquelles nous venons de discuter étaient tout a fait indéter- 
minées et hypothétiques. Nous disions: s7 telles et telles circonstances sont satis- 
faites, alors tout texte doit étre construit au moyen d’unités concrétes, ayant des 
fréquences bien déterminées, et suivant la loi canonique. Arrivons maintenant 
au point essentiel: chercher si les textes réels comportent des unités concrétes 
suivant notre définition. Si de telles unités existent, nous avons remarqué, 
p. 13, qu’elles devraient jouer un réle structural, donc probablement avoir une 
définition intuitive, et un mot pour les désigner dans la langue courante. Réci- 
proquement, pour identifier les unités concrétes, on n’aura pas besoin d’examiner 
toutes les unités possibles des textes, mais seulement celles qui ont une définition 
intuitive, et tout d’abord les unités les plus courtes parmi celles que |’on peut 
combiner presque librement, c’est-A-dire les mots. Comme on considére Jes mots 
en tant que “briques”’ de tout texte, il faut faire simultanément la statistique des 
verbes, substantifs, etc . . . mélangés; il ne faut pas identifier les synonymes, et 
surtout, il faut considérer les diverses formes fléchies d’un mot du dictionnaire 
comme des mots différents. (Ceci implique que dans le codage optimal que nous 
avons considéré, deux unités trés voisines grammaticalement pourront avoir des 
“‘orthographes’’ trés différentes, mais il n’y a rien d’absurde & cela, car, en fait, 
pour les verbes les plus fréquents, méme les codages publics sont trés différents, 
et personne n’éprouve de difficulté 4 reconnaitre que ce sont 14 des formes d’une 
méme unité du dictionnaire.) 

Les statistiques faites par Estoup (1916), Zipf (1935, 1949), Josselson (1953) 
et al., suivaient précisément les principes ci-dessus. Dés lors, par simple examen 
des données de ces auteurs, on voit que la loi canonique, au moyen de paramétres 
appropriés, permet de représenter mathématiquement les statistiques de mots. Nous 
renvoyons en particulier 4 Zipf (1949). Nous en concluons que les mots possédent 
la propriété centrale qui pour nous définit les unités concrétes: les textes sont “bien 
construits’’ comme suites de mots. Toutefois, les courbes rang-fréquence de Zipf 
ne sont pas trés satisfaisantes pour les mots les plus rares. Pour vérifier la canoni- 
cité des mots plus exactement, on peut examiner la variation du nombre de mots 
différents avec le nombre de mots total d’un texte. Des données nombreuses & 
ce sujet sont dues & Chotlos (1944) et & Baker (1950, 1951). Elles confirment 
encore les prévisions de la théorie. Le caractére concret des mots comme unités de 
tout texte est donc encore confirmé. Remarquons toutefois que rien n’implique 
qu’il ne puisse pas y avoir d’autres unités concrétes que les mots. En particulier, 
les mots et les morphémes peuvent étre simultanément concrets, et ceci semble 
étre effectivement le cas dans certaines langues indiennes d’Amérique. 
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CARACTERE CONCRET DES MOTS ET REPRESENTATION DES IDKES 


Nous venons en somme de montrer que la structure des textes ne saurait étre 
améliorée du point de vue de la statistique. A premiére vue, il peut paraitre étrange 
que ce soit le point de vue auquel |’on songe le moins qui soit le mieux servi de 
tous. Mais ceci n’est pas bien raisonné, car si on ne songe jamais a la statistique, 
c’est peut-étre bien, aprés tout, parce que |’on “‘sait’’ inconsciemment que tout 
est au mieux de ce cété-ci. Au contraire, si l’on songe tellement & la fidélité de la 
représentation que la langue donne des idées, et 4 la logique des raisonnements, 
c’est peut-étre bien parce que les choses ne vont jamais trop bien de ces cétés-la, 
et non pas parce que ces aspects sont les plus “importants”. Clairement, il 
s’agit ici de problémes philosophiques difficiles, que les boutades ne peuvent pas 
résoudre. 

La difficulté est en somme due au fait que l’on doit étudier des textes naturels, 
incorporant toutes sortes d’influences inconscientes. Par suite, on ne peut pas 
faire varier & volonté |’influence du désir d’étre logique, et de bien décrire ce 
qu’on veut décrire, car il faudrait un effort conscient pour obtenir des textes 
différant trés sensiblement les uns des autres. 

Fort opportunément, dans les cas de ce genre, on dispose toujours de la mé- 
thode pathologique. I] existe des malades mentaux qui possédent une mé- 
canique émettrice de mots pratiquement intacte, mais chez qui la correspondance 
entre les mots et les idées est gravement atteinte. Alors, on peut dire que les 
textes émis décrivent le malade, plutét que ce que le malade veut exprimer. 
Dans ces textes, la cohérence sémantique et syntactique peut étre trés petite, 
et la loi canonique devrait étre satisfaite par des textes trés courts (tandis que 
dans les textes de sujets normaux, il faut prendre des précautions trés grandes 
pour que les échantillons étudiés ne soient pas déformés par suite du fait que 
tout texte est consacré 4 un sujet particulier). Il semble que le langage des 
schizophrénes donne une bonne approximation aux effets pathologiques que 
nous cherchons, et, effectivement, les données sur les schizophrénes, réunies par 
Zipf et Whitehorn et par Baker (1950) donnent, 4 notre connaissance, les meil- 
leures vérifications de la loi canonique et de ses conséquences relatives a la 
variation du nombre de mots différents lorsqu’on augmente le nombre de mots 
total. 

A l’autre extrémité, on a un autre cas pathologique, le Basic English, que 
personne ne parle automatiquement, et qui est une traduction de |’anglais 
véritable, fait par petits groupes de mots. On nous a communiqué oralement 
que les textes en Basic English ne satisfont pas & la loi canonique. 

Considérons enfin les langues artificielles, telles que l’esperanto. Ces langues 
s’efforcent d’atteindre un trés grande régularité dans |’emploi des formes, et 
leurs constructeurs ne se préoccupent bien entendu pas de statistique. I] serait 
intéressant de voir si ces langues n’atteignent pas, 4 l’usage, un état ov les 
qualités qui avaient motivé leur synthése se perdent au profit de qualités plus 
pragmatiques. Pour le moment, nous ne connaissons qu’un résultat qu’on nous 
a@ communiqué oralement: les textes en esperanto ne satisfont pas non plus & 
la loi canonique. 
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L’ADAPTATION DU DECODEUR AU MESSAGE A-T-ELLE UNE REALITE PHYSIOLOGIQUE? 


Jusqu’ici, nous n’avons vérifié que les conséguences de la théorie. On peut étre 
tenté de voir maintenant si les étapes intermédiaires de la théorie ont également 
un sens physique, ou si ce sont des créations de |’esprit. I] est tout d’abord clair 
que ces étapes ne peuvent pas avoir foutes un sens physique, car on peut con- 
struire toutes sortes de légéres variantes de la théorie, qui restent structurale- 
ment identiques 4 la méthode que nous avons adoptée, mais en différent dans 
le détail. I] faut donc se limiter a priori aux étapes communes 4 toutes les théories. 

Les faits essentiels sont que la variété-information de Shannon a un sens 
physique, et qu’il existe quelque part un codage dans lequel le coft du code d’un 
mot est proportionnel & la fois 4 la fréquence du mot et & son rang dans le classe- 
ment des mots par ordre de fréquences décroissantes. On peut essayer de vérifier 
ce deuxiéme fait. 

On constate aisément que le nombre de lettres ou de phonémes ne peut pas 
étre le coit de cette théorie. Nous avons déja mentionné ce fait et nous voyons 
maintenant qu’il n’a rien de surprenant. En effet, méme si la prononciation, en 
tant que code, était construite de fagon “‘idéale’’, les critéres définissant cet idéal 
ne seraient pas les critéres que nous avons utilisés. En particulier, l’orthographe 
et la prononciation d’un mot doivent étre les mémes pour tous les sujets parlant 
une langue, quelque fréquent que soit l’usage du mot par le sujet, ce qui con- 
tredit notre critére. 

D’ailleurs, il aurait été surprenant qu’un principe d’économie et de compromis 
s’applique a un codage faisant intervenir des nerfs et des muscles périphériques, 
qui est certainement économiquement moins critique que les codages situés plus 
haut dans le cerveau. Malheureusement, ces derniers ne peuvent étre observés 
directement, mais tout au plus a l’aide de dispositifs expérimentaux assez com- 
pliqués. 

On peut par exemple essayer d’identifier le “‘cofit-délai” de nos raisonnements 
avec le temps nécessaire pour lire un mot écrit (qui est plus facile 4 mesurer que 
le temps nécessaire pour comprendre un mot prononcé). Effectivement, D. H. 
Howes et R. L. Solomon (1951) ont montré que ce temps dépend logarithmique- 
ment de la fréquence supposée du mot chez le sujet étudié. Mais ces expériences 
présentent, selon leurs auteurs eux-mémes, deux défauts sérieux. Tout d’abord, 
les fréquences supposées des mots n’étaient pas les fréquences réelles, mais les 
fréquences moyennes de Thorndike et Lorge, qui de plus ne sont pas prises suivant 
les principes que nous avons recommandés. D’autre part, la longueur d’un mot 
écrit est, en gros, liée 4 la rareté de ce mot. Bien que cette liaison soit insuffisante 
pour identifier le cot 4 la longueur, elle peut fort bien influer sur le temps 
nécessaire 4 la lecture d’un mot. Pour éviter ces défauts, R. L. Solomon (cf. 
G. A. Miller, 1951, p. 205 et R. L. Solomon et L. Postman, 1952) a construit des 
“langues” composées de mots dénués de sens, mais pronongables, et ayant tous 
8 lettres, que l’on présentait aux sujets avec des fréquences connues de |’expéri- 
mentateur. Ces auteurs ont trouvé que la durée de lecture est bien propor- 
tionelle 4 la fréquence et au rang du mot étudié, ce qui semble confirmer que les 
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probabilitésa priori’ et l’adaptation ont une réalité physiologique. De pius, l’adap- 
tation est susceptible d’étre réalisée rapidement ou de changer rapidement avec 
les besoins. 


DESCRIPTION MACROSCOPIQUE DES DIFFERENCES INDIVIDUELLES 


La théorie que nous venons d’exposer s’est jusqu’ici attachée aux propriétés 
communes & tous les textes. Ces propriétés peuvent étre résumées en disant qu’il 
est possible pour tout texte de choisir des “paramétres’”’ R (nombre total des 
mots du vocabulaire) et B et p, de telle fagon que la courbe canonique corres- 
pondant 4 ces paramétres, et les courbes dérivées, passent toutes par les points 
expérimentaux fournis par le texte étudié. 

Il nous faut maintenant étudier les propriétés des textes qui résultent des 
valeurs particuliéres des paramétres de ce texte. Pour reprendre une comparaison 
déja utilisée, nous avons étudié les propriétés analogues au fait qu’un individu 
peut s’habiller dans un magasin. Maintenant, nous allons étudier les propriétés 
analogues & celles qui résultent du fait que les tailles de l’individu sont 42 L 34, 
ou toutes autres tailles déterminées. 

Nous nous limiterons absolument aux propriétés qui résultent des valeurs des 
paramétres, et négligerons entiérement la langue dans laquelle on parle, sauf si 
sa structure influe sur les valeurs des paramétres. Notre description trés simpli- 
fiée sera appelée macroscopique, par analogie avec la description macroscopique 
d’un gaz au moyen de sa température, volume, pression etc. . . . Cette descrip- 
tion d’un gaz avait été introduite dans la théorie de la chaleur par opposition 
avec la description microscopique, qui est inaccessible, mais aurait consisté & 
donner sa composition chimique, et la position et la vitesse de chacune des molé- 
cules. Dans le cas de la langue, la description microscopique est constitutée par 
le texte lui-méme. Elle est donc bien accessible, mais elle est aussi d’une com- 
plication telle qu’il faut absolument simplifier, ce dont la linguistique doit pré- 
cisément se charger. Une description macroscopique n’a d’utilité que si elle 
permet de réduire considérablement le nombre de données numériques décrivant 
un texte, sans réduire de facon aussi grave la précision de la description du texte 
qui est offerte par ces données. Cette situation est réalisée ici, parce que la langue 
est un systéme. Ses propriétés ne sont pas indépendantes les unes des autres, et 
beaucoup d’entre elles dépendent de certains paramétres qui donnent au con- 
cept vague de “richesse de vocabulaire” un sens technique et intrinséque, in- 
dépendant de la méthode d’étude. 

Cependant la propriété la plus importante d’un systéme, et la plus riche de 
conséquences structurales, n’est pas toujours la plus apparente et la plus intui- 
tive. Tel est encore le cas ici, ov il se trouve que beaucoup de choses dépendent 


5 Une autre justification, tout & fait indépendante, du concept de probabilité a priori 
est offerte par l’examen de ‘‘La fille du capitaine’”’ de Pouchkine. Les collaborateurs de 
H. H. Josselson (1953) ont compté les intervalles entre les apparitions successives d’un 
certain mot, et ont trouvé exactement ce qu’il faudrait trouver si ce mot revenait au hasard 
ce qui ne signifie pas que tel soit le cas, mais seulement qu’il n’est pas contradictoire d’ad- 
mettre l’hypothése du hasard en premiére approximation. 
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du seul nombre B ou de son inverse 1/B, que nous appellerons la température 
informationnelle du texte. Grande température signifie que les mots disponibles 
sont bien employés, les mots rares étant eux-mémes utilisés avec des fréquences 
appréciables. Petite température signifie que les mots sont mal employés, les mots 
rares étant extrémement rares. On finit par se familiariser avec ce concept, mais 
on peut s’étonner a priori que ce soit la le concept central de la théorie, au lieu 
du seul de nos paramétres qui paraisse avoir un sens intuitif: le nombre de mots 
différents R. Toutefois, 4 la réflexion, on voit qu’il est bon que R ne joue pas un 
réle central. En effet, R est un nombre potentiel, c’est-A-dire qu’il faudrait un 
texte de longueur infinie pour le mesurer directement; dans le cas de textes de 
longueur finie, on doit |’estimer par |’intermédiaire d’autres nombres directe- 
ment mesurables. Au contraire, B est aisément mesuré comme pente de la courbe 
rang-fréquence en coordonnées bi-logarithmiques, et il en est de méme de la 
variété-information. 

L’apparence qualitative des textes est nettement différente selon que la tem- 
pérature est plus petite que 1, ou plus grande que 1. 

Si la température est plus petite que 1, ce qui est le cas dans la grande majorité 
des cas, on dira par définition (définition due 4 Zipf) que le texte est ouvert. Ce 
terme est di au fait qu’on peut démontrer que, dans ce cas, le nombre R ne peut 
absolument pas étre estimé de fagon tant soit peu exacte. Le nombre potentiel 
de mots différents doit étre considéré comme indéterminé, parce qu’indifférent. 
En pratique, on peut le considérer comme extrémement grand, beaucoup plus 
grand que ne |’aurait strictement exigé la variété d’expression qu’on utilise 
effectivement. 

Pour les textes ouverts, la variété croit en général avec la température. Par 
exemple, James Joyce, qui a une extréme variété, a aussi un B trés voisin de 1. 
Cet exemple a méme été un mauvais guide pour Zipf, car cet auteur considérait 
Joyce comme étant le meilleur échantillon 4 sa disposition, & cause de sa longueur 
et de sa variété, et prenait B = 1 comme étant la meilleur estimation de B pour 
tout auteur, tandis qu’en fait cette valeur est due 4 la variété exceptionnelle- 
ment grande du texte de Joyce. A |’autre extrémité, les schizophrénes ont une 
trés faible température (B varie de 1.4 4 1.6). (De plus, ainsi qu’on I’a fait re- 
marquer p. 21, la canonicité est trés rapidement atteinte). Certains enfants ont 
également une faible température, et en général plus faible pour les enfants dont 
le I.Q. est plus petit. (Ceci peut bien signifier que le I.Q. considéré est une mesure 
de verbalisation plus que d’intelligence.) Les valeurs typiques pour les auteurs 
anglais considérés vont de B = 1.14 B = 1.3. 

Ainsi les variations individuelles sont trés grandes parmi les textes d’une méme 
langue. Elles sont sans doute tout aussi grandes dans les autres langues. I] 
faudrait plus de données que nous n’en avons pour pouvoir dire que les textes 
de telle langue ont, en général, une température plus grande que ceux de telle 
autre langue. 

Si la température est plus grande que 1, ce qui ne se trouve que dans des cas 
exceptionnels, on dira par définition que le texte est fermé. Ce terme est dd au 
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fait qu’on peut démontrer dans ce cas que le nombre R (nombre potentiel de 
mots différents) peut étre estimé de fagon exacte 4 partir des données rang-fré- 
quence. Cette situation est due au fait que les mots rares sont employés rela- 
tivement plus fréquement que dans le cas ouvert; et ce dernier fait signifie 4 
son tour que le vocabulaire disponible n’est pas aussi surabondant que dans le 
cas des textes ouverts. 

Effectivement, les seuls exemples de textes 4 mots de temperature supérieure 
& 1 sont l’hébreu moderne vers 1935, certains poétes trés puristes et enfin le latin 
dans la prose de Notker et l’anglais dans le Pennsylvania Dutch. 

Dans le premier cas, le vocabulaire était limité pour des raisons historiques, 
tandis que la variété avec laquelle on désirait utiliser ce vocabulaire était grande. 
Dans ie deuxiéme czs, le vocabulaire était intentionnellement limité. Dans les 
deux autres cas, il s’agit de mots d’emprunt, peu nombreux par définition pour- 
rait-on dire, qui se trouvent incorporés dans des textes allemands (qui, eux, ont 
une température inférieure 4 1). On voit donc par ]’exemple qu’un mélange de 
deux vocabulaires de températures différentes peut étre stable, 4 |’échelle de la 
vie de l’individu. Mais a la longue, il se produit un mélange intime, comme en 
anglais, et la température s’>homogénéise. I] serait intéressant de voir si l’hy- 
pothése de mélange peut expliquer les données de Zipf relatives au yiddish, qui 
constituent la seule exception visible 4 la canonicité des mots (et ot les diverses 
composantes ont été considérés simultanément dans le décompte que nous avons 
pu observer). 

I] serait également intéressant d’éprouver notre explication des cas de tempéra- 
ture supérieure 4 1, en étudiant de fagon comparée des textes authentiquement 
populaires ou enfantins, ot la température devrait étre inférieure 4 1, avec des 
textes écrits par des adultes s’astreignant 4 n’employer qu’un vocabulaire re- 
streint. Méme intérét pour la langue des aphasiques. 

Remarque sur les vocabulaires mélangés. La théorie qui précéde ne s’applique 
qu’aux vocabulaires individuels. Cependant, les différences entre individus sont 
relativement légéres si on ne considére que des textes d’adultes normaux. Par 
suite, les propriétés de mélanges de textes normaux restent assez voisines des 
propriétés de chacun des individus. En particulier la statistique reste assez 
voisine de la canonicité. 

Autres différences individuelles. Notre objet n’est pas de décrire les différences 
pour elles-mémes, méme si elles sont descriptibles de fagon statistique. I] nous 
suffra donc de signaler les travaux de Udny Yule (1945) et de ses successeurs, 
qui ont pour but de dégager des indices statistiques, aussi variables que possible 
d’un auteur & |’autre, et par suite susceptibles de servir 4 la reconnaissance de la 
paternité des textes. Pour cela, Yule compte simultanément toutes les formes 
grammaticales de chaque unité du lexique, et séparément les diverses parties du 
discours. I] en résulte que ses résultats ne sont pas absolument comparables avec 
ceux de Zipf, qui servent de base 4 la présente théorie, et sont méme incapables 
de contribuer 4 étendre cette théorie 4 des nouveaux phénoménes. 
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IV. CONCLUSION 


La possibilité d’appliquer les concepts de la théorie des communications a 
des problémes non électrotechniques constitue le principal postulat de la cyber- 
nétique de Norbert Wiener (1948). Nous avons, nous |’espérons, contribué 4 
prouver par l’exemple que ce postulat est justifié. Que cette démonstration ait pu 
étre faite en linguistique avant d’étre faite pour les autres sciences sociales n’a 
d’ailleurs rien de surprenant, car c’est dans ce domaine que |’on posséde les 
suites numériques les plus nombreuses, et que les observations sont le moins 
perturbées par |’observateur (cf. Lévi-Strauss, 1951). 

Aprés Vinfluence de Wiener, il nous faut signaler |’influence posthume que 
Saussure et Zipf ont exercée sur ce travail. La théorie du premier fournit un 
cadre trés élégant et trés géométrique, et, bien que nous en différions trés souvent 
dans nos conclusions, les problémes qu’il pose ont souvent un sens dans le cadre 
de la théorie informationnelle, ce qui est loin d’étre le cas pour la plupart des 
traités linguistiques. De méme Zipf est trahi par les insuffisances de sa méthode 
et aboutit quelquefois & des points de vue incohérents et absurdes. Mais ses 
écrits posent clairement le probléme des relations des structures linguistiques 
avec des éléments appartenant & des sciences plus fondamentales dans la classi- 
fication d’A. Comte. 

Les données expérimentales connues sont encore trop concentrées sur les textes 
anglais, 4 cause de la nationalité des chercheurs et de la facilité de l’étude de 
cette langue. Nous souhaitons que les décomptes normaux et pathologiques se 
multiplient, et que de nouvelles propriétés soient étudiées numériquement. Ainsi 
le chapitre “linguistique statistique’ de la nouvelle physique de la langue se 
développerait et s’étofferait, et le jour approcherait ot |’étude de la langue pas- 
sera définivement la barriére fictive qui l’a jusqu’ici séparée des sciences exactes. 
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English Summary 
FORMAL STRUCTURE OF TEXTS AND COMMUNICATION 


From the experience that one has of speech organs, it could seem that it fol- 
lows that the physics and the physiology of communication have no influence 
upon the deeper and more general structural properties of language. We propose 
to prove that, in two respects at the least, this is not in fact the case, and that 
the properties of speech turn out to be those which would be attributed to it if 
language were reconstructed from scratch, account being taken only of the 
pragmatic conditions of communication, and not of its aims. 

A) The fact that the acoustic or pictural carriers of speech are continuous 
physical objects, susceptible to disturbance by noise, requires that there exist 
also another aspect to speech: a discrete message. We shall also derive in this 
way other properties of language. 

B) The statistics of words, considered as empty forms, collected by Estoup, 
Chotlos, Baker, Zipf, Josselson, et al.,in widely varied languages and authors, 
can be explained by assuming that the discrete messages due to each individual 
author are built out of “concrete units” identical with words, and that the statistics 
of these units are, intentionally or otherwise, those which transmit the greatest 
amount of information (in a technical sense used by Claude Shannon), compatible 
with a given complication of the receiving apparatus, expressed by an average 
cost of decoding a word, by an ideal decoder. A summary of our mathematical 
theory has been given by Miller (1954). 

These two properties seem to be a beginning of confirmation of an axiom of 
cybernetics. 


Research Laboratory of Electronics 
Massachusetts Institute of Technology 








PHONEME CLASSES AND PHONEME CLASSIFICATION 


Hans Vocr 


The phonemic description of a language is not complete if it confines itself to 
establishing the phoneme inventory and to studying the distinctive features of 
its members. It should also comprise a description of the phoneme combinations 
that occur or may be expected to occur in the language. It is generally assumed 
that these combinations conform to certain patterns varying from one language 
to another, and that it is possible to formulate a series of more or less general 
statements which account for these patterns, provided the material at our dis- 
posal is rot too small. According to the theoretical approach of the linguist or 
the term: aological framework of his description, this part of phonemics has 
been termed distributional, relational, or functional phonemics. In a paper 
recently published! the Danish linguist Eli Fischer-Jgrgensen has made a critical 
examination of some of the distributional analyses that have been published, 
of languages such as Polish, American English, Ancient Greek, Norwegian, 
some American Indian languages, and a few others, and 2s a conclusion she has 
tried to outline a general procedure for phoneme classification on a distributional 
basis which should, in principle, be applicable to all languages. Assuming that 
the phonemes of a given language have been classified in the fundamental 
classes of vowels and consonants, the members of these can be further classified 
in smaller classes. As regards the consonants, the combinations of which are more 
varied than those of the vowels, they can, following this procedure, be classified 
according to their occurrence in initial and/or final position, their occurrence in 
clusters initially and/or finally, and according to their position as first or second 
term in these clusters. Because in many languages ciusters of two consonants are 
more numerous than those of three or more, it will in most cases not be neces- 
sary to go beyond the clusters of two consonants in order to obtain a satisfactory 
classification. Some linguists have even maintained that the study of larger 
clusters will not yield any new significant results because they can all be dis- 
solved into clusters of two consonants—thus C, C2 C; can be dissolved into 
C, C2 and C, C;. The author of the paper mentioned has, however, shown that 
this statement does not hold universally, and she recommends taking account 
of the larger clusters wherever the study of the smaller ones does not prove 
sufficient. 

There are, however, languages where the situation is rather different from 
that prevailing in the languages which Miss Fischer-Jgrgensen has in view. 
Such is Georgian, one of the South Caucasian languages and the best known of 
them, with a considerable literature of more than a thousand years’ standing. 
The difference is not due to the number of phonemes, which is not extraordinary : 
5 vowels and 28 consonants (in Old Georgian, 31). The difference is above all due 
to the bewildering variety of the consonant clusters. With the exception of h, 


1 “The Definition of Phoneme Categories on a Distributional Basis,’”’ Acta Linguistica 
7.8-39 (1952). 
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which only occurs initially before vowels and thus never enters into any cluster, 
the consonants combine in clusters of two, three, and four, even of five and six 
terms, according to patterns unknown in the more familiar European languages. 
It may be noted, as a curiosity, that the most common type of clusters of three 
consonants in the Germanic languages, viz. s- followed by stop + sonant, is a 
rare type in Georgian, occurring chiefly in loanwords of Indo-European origin, 
and even “simple’’ clusters like sf- and &-, which appear in Old Georgian, have 
been transformed in the course of time into rt- and ré- —a phonetic development 
which could serve as a warning against hasty generalizations about ‘economy of 
effort” and similar catchwords by which some linguistic schools have tried to 
explain all regular phonetic changes. 

The total number of initial clusters in Georgian, as they occur in ordinary 
speech and in printed texts, runs into the thousands, and that of final clusters 
into the hundreds. The number of theoretically possible medial clusters should 
run into hundreds of thousands. It is of little interest to try to count them 
because they can apparently all be interpreted as the sum of a final and an 
initial cluster. If we limit ourselves to initial clusters, we shall discover, fortu- 
nately, that the greater part of them can be discarded as irrelevant to our pur- 
pose, viz. all those which are analyzable into a consonantal prefix preceding a 
stem with initial cluster. The use of the prefixes is, as a matter of fact, inde- 
pendent of any phonemic factor. All unanalyzable initial clusters may be pre- 
ceded by such prefixes, and the analysis of the resulting clusters does not yield 
anything from a structural point of view. Nevertheless, the number of remaining 
unanalyzable clusters, where no prefixes can be isolated, is considerable. My 
material offers the following data: 


Number of consonants in the cluster.............. 3 3 4 5 6 
Number of different clusters...................... 231 332 144 21 4 


with a: total of 732 initial clusters, all phonemically distinct.2 The number of 
clusters of three consonants thus by far exceeds the number of clusters of two, 
and although the number of still larger clusters decreases rapidly, the clusters of 
four cosenants and more still constitutes nearly a fourth of the total. 

If the 26 consonants combined freely with no other restriction of distribution 
than the limited number of terms in any cluster, the number of possible clusters 
would be staggering indeed (abov . 223 million). The actually occurring clusters 
represent only a small selection of these possible combinations. If we study the 
occurring clusters, we discover that this selection is not a random one but that 
it is based on certain preferred patterns of structure. The great majority of the 
clusters fall, as a matter of fact, into a limited although relatively large number 
of types. But if we limit ourselves, in our analysis, to the clusters of two conso- 
nants, the procedure outlined above will yield no significant results for the very 


? This figure is based upon my own interpretation of the facts. According to another 
possible interpretation this number could be reduced by some 20 or 30 units. I hope to be 
able to justify my interpretation in a forthcoming paper where I shall deal with the Geor- 
gian phonemic system in greater detail. 





30 HANS VOGT 


simple reason that all the consonants, always excepting h, occur both initially 
and finally, and all of them occur in clusters, initial and final, and all of them 
occur in the first and second position. There is apparently one exception: the 
phoneme Z does not seem to occur in the second position in final clusters, as 
opposed to s, §, and z, which occur finally in both positions. But as 2 is by far 
the rarest phoneme in the language, this gap may be purely accidental. Even if 
this gap should be structurally determined, we would only have succeeded in 
establishing two classes of consonants: the phoneme Z on one hand and all the 
other consonants on the other. The clusters of two consonants are thus not 
sufficient for a classification. It would therefore seem reasonable to take the 
larger clusters into account and try to work out the classification on this new 
basis. Such a procedure is, however, not practicable in our case for reasons 
which will be examined below. 

Although my material is drawn from a fairly extensive corpus—all accessible 
dictionaries and vocabularies, printed texts of tens of thousands of pages as weil 
as ordinary speech—there is every reason to believe, as experience has shown, 
that additional material would yield new clusters. The material will never be 
complete. It will always contain accidental gaps, partly because some occurring 
clusters may have escaped my attention, partly because some clusters by pure 
chance do not occur in the vocabulary. On the other hand, the patterns exhibited 
by the occurring clusters are extremely varied and often rather complicated and 
difficult to recognize, especially in the case of clusters containing two or three 
stops of which two may differ as regards voicedness, aspiration, and glottaliza- 
tion. It is therefore to be expected that the proportion of accidental gaps will 
increase with the number of terms of the cluster. In the class of clusters of two 
consonants there seem to be very few such gaps, in the clusters of three there are 
considerably more, and in the still larger clusters probably still more. It is 
therefore of little or no avail to resort to the larger clusters in order to arrive 
at an adequate classification, because we have no means of deciding whether 
such gaps or lacks of symmetry are due to pure chance or are determined by the 
structure of the language itself. 

How to distinguish between structural law (or rule) and chance distribution 
is therefore in Georgian the fundamental problem, and without at least a tenta- 
tive solution it would be difficult and probably impossible to give a coherent and 
consistent picture of the structural patterns of the Georgian consonant clusters. 
My thesis is that such a picture can only be drawn by taking into account not 
merely the actually occurring clusters, but also clusters which I propose to call 
virtual members of the system. In order to make clear the meaning of this term, 
an example from Danish mentioned by Miss Fischer-Jgrgensen and supple- 
mented from Norwegian material will serve as an illustration. 

In Old Norse no word is known with the initial cluster skl-, and even nowadays 
it appears in Danish only in obvious loanwords like sklerose. It may be assumed 
with some reason that such a foreign word has been assimilated in the modern 
vocabulary because its phonemic structure, including the structure of the initial 
cluster, did not conflict with the already existing cluster patterns. The cluster 
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skl- of sklerose eliminates a previous asymmetry in the system by “filling in” a 
striking gap. This assumption seems to be corroborated by the Norwegian data 
which on one point, however, offer an interesting difference. The Old Norse 
skrida has in many dialects been kept with unchanged initial cluster, but in 
other dialects as well as in Standard Norwegian (of the Riksmaal variety) it 
has been changed into skli ‘to slide’. In this sole morpheme with its derivatives, 
the initial cluster skl- has emerged within the language itself, with the effect 
that the introduction of foreign elements such as sklerose can have caused no 
phonemic difficulties. The cause of the phonetic mutation skr- > skl- seems 
perfectly clear: we have to do with a typical case of phonetic symbolism, due 
to the existence of numerous words, semantically related and containing initial 
clusters with / as their last term (kl-, gl-, sl-). What is remarkable is not this 
rather common phonetic change but the fact that it has given rise to a cluster 
previously unknown in the language. This can only have happened because this 
previously unknown cluster did not, by its emergence, change any already 
existing patterns. I propose to call virtual a cluster which does not occur in the 
corpus of texts but whose emergence would not change the patterns exhibited 
by the actually occurring clusters—a cluster whose existence is, so to speak, 
implied by these patterns. In other words, it is a cluster which can be expected 
to appear in one or more words, if the corpus is infinitely extended. According 
to this terminological convention the consonant cluster system would have to be 
established on the basis of the sum of the actually occurring clusters and the non- 
occurring virtual clusters. This distinction between actual and virtual members 
of a system may have no great interest in languages such as English, Danish, 
and Norwegian, where the consonant clusters are comparatively few and rela- 
tively symmetrical in their arrangement. In Norwegian such clusters as gv- and 
skn- might probably be included in the class of virtual clusters. But in Georgian 
this distinction is of paramount importance, because the number of virtual 
clusters probably exceeds that of the actually occurring clusters. The combina- 
tory possibilities of consonants in the Georgian language, as determined by the 
phonemic structure of this language, have only partly been exploited in the 
vocabulary. 

According to what has been said above, the extension of the class of virtual 
clusters will depend on what we understand by the structure of the system. 
Without trying to give a formal definition of the term structure, I shall here 
understand by the term the totality of distributional rules that can be formu- 
lated on the basis of the occurring clusters, and by structural rule I shall under- 
stand a statement concerning the distribution of phonemes in which at least one 
of the terms brought into relation by the statement is a class of phonemes. The 
statement that “b does not occur before k” is no such structural rule of distri- 
bution, because the two terms of the statement, b and k, are individual phonemes. 
The statement that ‘‘b does not occur before unvoiced stops” would on the other 
hand be considered a distributional rule, since the term unvoiced stops designates 
a class of phonemes. A description of the structure of the consonant cluster 
system is thus based upon the establishment of phoneme classes. If the phoneme 
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classes upon which the structural statements are based are defined in a purely 
relational way, the description will be a purely relational (distributional, func- 
tional) one; if the phoneme classes are defined on the basis of common phonetic 
features, the description will be a mixed relational-phonetic one. 

If we disregard completely the phonetic substance of the phonemic units, we 
can classify the phonemes by their relations to each of the phonemes in the 
language. Let us, for Georgian, take the consonant v as an arbitrary starting 
point. By its occurrence and position in initial clusters of two consonants we 
can define four phoneme classes, of which two and two are mutually exclusive. 
In fact, the consonant v can follow all consonants except b p p m n and v. These 
six consonants will form one class, negatively defined; the remaining 21 conso- 
nants will form another class, positively defined, and the two classes are mutually 
exclusive by definition. On the other hand, the consonant v can precede initially 
sr land n, and it is thus possible to constitute on this criterion another pair of 
mutually exclusive phoneme classes. Taking in this way one phoneme after 
another, we can establish 27 X 4 = 104classes, or 54 pairs of mutually exclusive 
classes. No two of these classes are, as‘a matter of fact, identical. New sets of 
classes can be established by those consonants which have two or more functions 
in common, e.g. the class of consonants which in initial position can precede and 
follow v, viz. s, r and l, and by using the clusters of two consonants as classification 
criteria instead of one single phoneme, new sets of classes can be defined. Thus 
the cluster cr precedes r 1 m n v and d in initial clusters of three consonants, 
and follows p k r and m in such clusters. An almost unlimited number ot classes, 
partly mutually exclusive, partly overlapping, may thus be set up on the basis 
of the actually occurring clusters. But without taking into account the phonetic 
features of the phonemes one has no means of deciding—or even of guessing— 
which of these purely relational «asses have any structural relevancy and which 
of them are due to pure chance. Here there is a difference between Georgian 
and languages such as English and Danish or Norwegian—in the latter ones 
there will be close correspondence between phonetically established classes and 
the purely relational ones, because the clusters are relatively few, and because 
the accidental gaps are rare and easy to recognize. If, however, in the case of 
Georgian, we accept as given the phonetic classification of the phonemes, we 
can make more or less hypothetical decisions of structural relevancy. If we 
consider that b p and 7 are labial stops and that the consonant v also contains 
a labial feature, we may presume that the non-occurrence of the clusters bv- pu- 
and pv- is structurally determined. On the other hand we presume that the non- 
occurrence of §n- is purely accidental, because we know that all other stops and 
affricates occur initially before n. We may even think that this gap is due the 
lesser frequency of the phoneme 3. Following the terminology suggested above, 
we would say that bv-, which does not occur, is also not a virtual member of the 
system, because we can formulate the rule: labial stops do not occur before »v. 
But we would consider the cluster gn- as a virtual member of the system because 
we can formulate no rule of the same kind. There are, it is true, other phonemes 
which share with § this negative function of not occuring before n, namely 
p plr and Z, but we cannot set up these phonemes as a class defined either 
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phonetically or by some other functional criterion. The statement: p p 1 nr Zz 
and 3 do not occur before n in initial clusters, is therefore no true structural 
statement, but only a convenient way of summarizing seven individual state- 
ments, viz. p does not occur before n, » does not occur before n, and so on— 
statements of which each may refer to purely accidental facts or to structurally 
determined ones. Only by combining distributional and phonetic f: cts is it 
possible to arrive at a phoneme classification which is relevant to the description 
of the consonant cluster system. 

There are border cases where it is difficult to decide. The labial stops do not 
occur before v, as already mentioned, but b occurs in some rare cases before 
the nasals m and n, whereas the two other labial stops p and p do not occur in 
the same clusters. We could therefore formulate the rule: the unvoiced labial 
stops do not occur before nasals—a statement which has all the formal charac- 
teristics of a structural rule, since the two terms it relates are both phoneme 
classes. If one hesitates in accepting it as a true structural rule, it is because the 
distinction between voiced and unvoiced stops in no other cases seems to be 
relevant before sonants. The simplest way of eliminating this problem would 
be to relegate the clusters bm- and bn- from the system—taking the term system 
in a narrower sense than is usually done. These clusters would belong to a small 
class of clusters which do occur but which do not conform to the structural 
patterns exhibited by all the other clusters. As a matter of fact, in a language 
such as Georgian with consonant clusters of varying types, morphological 
processes may be expected to occasion the emergence of clusters which exhibit 
unusual structures. These clusters constitute marginal elements, fringes of the 
system, as opposed to the vast majority of clusters which constitute the basic 
system. We would thus by definition have established four classes of clusters 
for our descriptive purposes: the class of clusters which do not occur and which 
do not conform to any structural patterns in the language, called inadmissible 
clusters; the class of clusters which occur but do not conform to any structural 
patterns, called marginal clusters; the class of clusters which do not occur, but 
conform to the structural patterns of the language, called virtual clusters; and 
finally the class of clusters which occur and conform to certain patterns, called 
actual clusters. 

There are in Georgian clear cases of conflict between phonemic tendencies 
and morphophonemic rules which give rise to such marginal clusters. In Old 
Georgian there is a whole series of verbs which oppose * so-called strong aorist 
characterized by the absence of any tense suffix to a secondary present with the 
suffix -av: v-kal ‘I killed him’ and v-h-kl-av ‘I kill him’. The infinitive of such 
verbs is formed from the present stem with the suffix -a: kl-av-a > klva. The 
sequence -ly- is, however, inadmissible in modern Georgian, and is therefore 
eliminated by metathesis: klva > kvla. The form kvla is the modern form which 
with its sequence -vl- is perfectly conformable to the modern consonant cluster 
patterns. In one verb, however, the old infinitive has been kept unchanged, 
viz. plva ‘to inhumate’ (present v-pl-av, aor. v-pal). The reason for this unique 
treatment of the sequence -lv- in this verb is obvious: the metathesis which 
would eliminate it would on the other hand have given rise to the initial sequence 
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pu-, & sequence which is absolutely inadmissible. The actually occurring cluster 
plv- is such a marginal cluster which is outside the basic system; it does not con- 
form to any of the patterns exhibited by other initial clusters with two sonants 
as final terms. 

The terminological framework we have tried to lay down has seemed con- 
venient for the description of the complicated Georgian system. The definition 
of the terms will in the last instance depend on the definition of the funda- 
mental concepts of system and structure, and as all these definitions are inter- 
dependent, we will be moving in a vicious circle, and it is hard to see how you 
can break this circle without relying on extra-linguistic criteria on one point, 
arbitrarily chosen. The content of the term system will e.g. to a large extent 
depend on how we delimit our corpus of texts and how we interpret elements 
such as loanwords, dialect forms and foreign words which it contains. One can 
choose to relegate them from the system, because one has from the start some 
preconceived notion of the nature of the system, or one can include them in the 
system with the result that the description will be extremely complicated. 
From a purely empirical point of view the initial cluster ps- belongs to the 
Norwegian and Danish system, since it occurs in perfectly familiar words such as 
psykologt, psykisk, and so on. We may accept the pattern: stop followed by s as 
belonging to the system and interpret the clusters ps- and ks- as specimens of 
this pattern, occurring in some isolated words. In any case it seems imperative 
to distinguish between clusters of this type and clusters of the far more common 
type: s followed by stop, much more important in the analysis of these languages. 
Whatever decisions the linguist makes in delimiting his material, he must state 
explicitly why he chooses to make them and what he aims at doing by making 
them. The soundness of his judgment can only be estimated in each separate 
case by the results he arrives at in his description, which is required to be ex- 
haustive, consistent and as simple as possible. Exhaustiveness and simplicity 
seem, unfortunately, quite often to exclude each other. 

The main conclusion I would like to draw from this sketch of some of the 
difficulties one has to cope with in the study of Georgian, is that a purely distri- 
butional description of the phonemic system which accounts for all occurring 
clusters and at the same time allows us to predict what clusters may be expected 
if the corpus is infinitely extended, is in some cases impossible without regard 
to the phonetic classification of phonemes. A phonetic classification based on 
objective acoustic criteria seems nowadays a possibility within a not too distant 
future. This does not mean that all significant phonetic features are relevant to 
the structure of the phoneme combinations. The phonetically relevant difference 
between stops and affricates, between s-sounds and §-sounds seems thus in 
Georgian to be of practically no importance for the description of the cluster 
patterns. Which phonetic features have a structural relevancy in this respect 
and which have none, is a problem which has to be solved for each language. 
Acoustic analysis and phonemic interpretation supplement each other—the 
latter will often prove impossible without the former and one may doubt whether 
the former can be done adequately without the latter. 
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A LOCUTIONAL CONTAMINATION 
Emitio PERvzzI 


The expression “‘to save appearances” exists today in most European lan- 
guages: e.g. Engl. to save appearances, Fr. sauver les apparences, Ital. salvare le 
apparenze, Span. salvar las apariencias, Port. salvar as aparéncias, Cat. salvar 
les aparences, Lad. salvar l’apparentscha, Rum. a salvd aparenjele, Germ. den 
Schein retten, Dutch den schijn redden, Norw. redde skinnet, Pol. zachowaé 
pozory, Hung. fenntartja a ldtszatot, MGk. sézein ta proskhémata, Alb. mé péshtue 
dukén, Turk. zavahiri kurtarmak, etc. (only a few tongues usually have recourse 
to such expressions as “‘to keep up reputation” and the like, e.g. Russ. sobljudat’ 
prilidija, Czech zachovati slusnost). 

No doubt we have here a locution which may be defined as European to the 
fullest extent of this term, i.e. by origin, area and mode of diffusion (see Saggi 
di linguist. eur. I, Salamanca 1954, p. 1-15), and which has got into current us- 
age from the scientific language. 

Its most remote antecedent is the Aristotelian sézein tén hupéthesin (4th 
cent. B.C.), De Caelo III, 7 (306a, 26-30): pros dé tovitois andnkhé mé pan s6ma 
légein diairetén, alla mdkhesthai tats akribestdtais epistémais; hai mén gar kai 
to noétén lambdnousi diairetén, hai mathématikat, hoi dé oudé t aisthétin hdépan 
sunkhérotsi did td bovilesthai sdzein ten hupéthesin “Besides this, they must 
assert that not every body is divisible, in contradiction to the most accurate of 
sciences, mathematics. For mathematics conceives even the intelligible as 
divisible, whereas these theorists, in their eagerness to preserve their hypothesis, 
deny the property to some of the sensible world” (transl. W. K. C. Guthrie). 

In the 4th cent. A.D., while the Aristotelian expression occurs in Themistius’ 
paraphrase of the Stagirite (In Phys. 1.2, ed. Schenkl, Comment. in Arist. Graeca 
V, Berlin 1900, p. 3, 31), Simplicius most frequently uses sézein ta phainémena 
(e.g. Comment. cit. VII, ed. Heiberg, Berlin 1894, p. 509, 13-19; p. 510, 31; 
p. 516, 13; p. 516, 24; p. 519, 10-11 etc.) and offers unquestionable evidence 
that it must have been current in astronomical writings at a much earlier 
period, perhaps even dating back to the same 4th cent. B.C. either in Eudoxos 
or in Aristotle’s disciple Eudemos, who (according to Sosigenes, from whom 
Simplicius takes it) attributed to the Cnidian scholar the credit of having 
succeeded for the first time in explaining the planetary motions by certain 
simple hypotheses (In. L. de Caelo 11.12, Comment. cit. VII, p. 488; cp. on this 
subject Hultsch, Pauly-Wissowa Realencycl. XI, col. 939): 


Kai prétos tén Hellénon Eridozos ho Knidios, his Ervidémés te en t6i deutérdi tés Astrologikés 
historias apemnéméneuse kai Sdsigénés para Eudémou totto labén, hépsastai légetai ton 
toiotiton hupothésedn Plétdnos, hés phési Sdsigénés, prébléma tofito poiésaménou tots pert 
tatita espoudakési, tindn hupotetheis6n homalén kai tetagménin kinésedn diasdthéi ta peri tas 
kinéseis t6n plandménin phainémena. 


It is apparent that the diffusion of the locution in European languages is 
due to the frequent usage of its Lat. rendering salvare apparentias and the like 
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throughout the Middle Ages. An author such as Thomas Aquinas (13th cent.) 
will write for instance: 


“Tllorum autem suppositiones, quas adinvenerunt, non est necessarium esse veras: licet 
enim talibus suppositionibus factis appareant solvere, non tamen oportet dicere has suppo- 
sitiones esse veras: quia forte secundum aliquem alium modum nondum ab hominibus 
comprehensum, apparentia circa stellas salvatur. ... Est autem ulterius sciendum, quod 
secundum suppositiones Eudoxi non poterat omnis apparentia circa stellas salvari’’ (De 
Caelo et Mundo II, 17, Opera XIX, New York 1949, p. 120). 


St. Thomas, then, attributed to the laws of physical science not the value 
of a thesis but that of a hypothesis, acceptable in so far as the law is consistent 
with the phenomena; which does not imply, thus, that no other theory may be 
advanced, by which the same phenomena could be satisfactorily explained, 
i.e. the same appearances could be saved. Though neither Aristotie nor his 
ancient and medieval successors made a clearcut distinction between philosophy 
of nature and natural sciences, such a distinction is preceded by what they said 
on saving the appearances. ‘Both philosophy of nature and natural sciences 
begin with phenomena which are sensibly observable, since all our knowledge 
has its origin in sense perception. But the philosophy of nature goes from the 
observable phenomena to the real essences or causes which underlie and account 
for the phenomena, while natural science goes from the observable phenomena 
to a formula expressing some regular order in the occurrences of the phenomena 
and, eventually, through the application of this formula, to the prediction of 
future phenomena” (Br. Benignus, Nature, Knowledge, and God, Milwaukee 
1947, p. 413). In other words, “scientific laws, properly formulated, never 
‘explain’ anything. They merely state, in an abbreviated and generalized form, 
what happens. No scientist ... knows why anything happens, or can ‘explain’ 
anything. Scientific laws do nothing except state the brute fact that ‘when A 
happens, B always happens too’”’ (W. T. Stace, Philosophy, IX, 1934, No. 33, 
p. 47). An inquiry into the exact meaning of the locution in technical language, 
as connected with the evolution of views about physical theory, may be dis- 
pensed with here, since it has no bearing on the development of its figurative 
acceptation (therefore it will suffice to refer to P. Duhem, Sézein ta phainémena, 
Essai sur la notion de théorie physique de Platon a Galilée, Paris 1908). It is 
apparent that it continued to be used in scientific Lat. and in the national 
languages of modern Europe, where it eventually developed its figurative 
acceptation, and the evidence so far available suggests that this change probably 
occurred during the 16th cent. 

The earliest example I know dates back to 1578: “‘purché vi sia qualche poco 
colore di onesta, che basti a salvare l’apparenza” (F. Sansovino, Propositioni 
overo considerationi etc., publ. 1583). The wording of the passage, however, is 
such that the expression must have been current in Italian at that time; which 
is substantiated by an evidence Sarpi offers for the year 1547: 


**. .. quando li decreti furono visti, pid diede da dire quello della fede, qual conveniva 
leggere e releggere molto attentamente e specularci anco sopra, non potendosi intendere 
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senza una perfetta cognizione delli moti interiori dell’animo, e serza saper in quali egli 
sia attivo e in quali passivo . . . Fu da alcuni faceti detto che se li astrologi, non sapendo 
le vere cause de’ moti celesti, per salvar le apparenze hanno dato in eccentrici ed epicicli, 
non era maraviglia se, volendo salvare le apparenze de’ moti sopracelesti, si dava in eccen- 
tricit&é de opinioni’”’ (P. Sarpi, [storia del Concilio Tridentino II, cap. 7, ed. Gambarin I, 
Bari 1935, p. 363). 


At the Tridentine Council the expression was thus used both in its technical 
and figurative meaning, and Sarpi ostensibly grasps them, since he reports 
what ‘fu da alcuni faceti detto” and certainly plays upon the double value, 
both technical and figurative, of “‘si dava in eccentricitaé de opinioni’’. 

The first Engl. example I have found occurs in N. Brent’s translation of 
Sarpi’s text: 


‘‘Some pleasant wits said, that if the Astrologers, not knowing the true causes of celestiall 
motions, to salue the appearances, haue inuented Eccentriques, and Epicicles, it was no 
wonder if the Councel, desiring to salue the appearances of the supercelestiall motions, 
did fall into ezxcentricitie of opinions’? (N. Brent, The Historie of the Councel of Trent, 
London 1620, p. 227). 


It is probable that Brent did not feel the wit of Sarpi’s text and took it in its 
literal meaning only (note e.g. ‘“‘haue inuented eccentriques” to render “hanno 
dato in eccentrici”), which would suggest, in turn, that to save appearances (fig.) 
was not known or at least not common in Engl. at the beginning of the 17th 
cent. To this effect cp. A. Newton’s translation of Sarpi’s passage, published at 
London in 1620, where the double meanings remain quite unheeded and are 
thus lost in the renderings “ad similitudinem veri conseruandam” and “ad 
conseruanda vestigia motuum supracoelestium, in opinionum excentricitatem 
inciderint” (P. Sarpi, Hist. Conc. Trident., nova ed., Lipsiae 1699, p. 382-383). 
One may reasonably guess, however, that to save appearances in its technical 
acceptation occurred in Eng!. at dates earlier than 1620, as is suggested by the 
earlier usage of to salve ‘‘to save’’ and thence ‘‘to clear up, to explain, to account 
for’’ (see the Ozford Dict. VIII, p. 71 s.v.) and that such an expression developed 
its figurative meaning about the epoch of the aforesaid translations, in view of 
the appearance of the figurative usage of eccentric about 1630 (see the Ozford 
Dict. III, p. 28 s.v.). The same conclusion is pointed to by Bacon’s report, prob- 
ably derived from Brent’s passage, that 


“Tt was grauely said, by some of the Prelates, in the Councell of Trent, where the doc- 
trine of the Schoolemen bare great Sway; That the Schoolemen were like Astronomers, which 
did faigne Eccentricks and Epicycles, and such Engines of Orbs, to saue the Phenomena; 
though they knew, there were no such Things: And, in like manner, that the Schoolmen, had 
framed a Number of subtile and intricate Aziomes, and Theorems, to saue the practise of 
the Church” (F. Bacon, A Harmony of the Essays, ed. E. Arber, Westminster 1895, p. 345). 


Bacon’s passage, dating back to 1625, is the first instance of to save phenomena 
quoted by the Oxford Dict., and certainly is one of the earliest possible examples 
of this expression, since it was in Bacon’s age that appearance began to be 
replaced by the more technical phenomenon (see the quotations of the Ozford 
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Dict. VII, p. 772 s.v. phenomenon, VIII, p. 71 s.v. salve and p. 138 s.v. save, 
and note that this affects the scholarly Lat. of that time, too, where phaenomenon 
is readily accepted, e.g. Bacon phaenomena univers: tit. in Works, ed. J. Spedding 
e.a. III, London 1857, p. 685, Hobbes ad salvanda naiurae phaenomena hy- 
potheses sex tit. in Opera philos. Lat., ed. Molesworth I, London 1839, p. 347, 
etc.). Bacon himself offers sufficient evidence of this fact, both by his indecision 
in the spelling of this not yet naturalized word (for the general indecision of the 
17th cent. as to the spelling and plural of phenomenon see the quotations of the 
Oxford Dict. VII, p. 772 s.v.) and by its substitution for appearance in a similar 
passage. As far as Bacon’s spelling is concerned, it will be sufficient to compare 
two passages of T’he Advancement of Learning, anno 1605: 


II, 9, 1 ‘‘So we may see that the opinion of Copernicus touching the rotation of the earth, 
which astronomy itself cannot correct, because it is not repugnant to any of the phaino- 
mena, yet natural philosophy may correct’’ and II, 8, 5 “for as the same phenomena in 
astronomy are satisfied by the received astronomy of the diurnal motion’”’ [i.e. the Ptole- 
maic theory] 


which show an indecision that has been already correctly interpreted by W. A. 
Wright in his edition of The Advancement of Learning, Oxford s.d. (1868, repr. 
1920), p. 363 s.v. phenomena. As to phenomenon instead of appearance, it is 
illuminating to notice how Bacon here changed the wording of a former passage 
of his text Of the Interpretation of Nature I: 


‘‘And therefore the opinion of Copernicus in astronomy, which astronomy itself cannot 
correct because it is not repugnant to any of the appearances, yet natural philosophy doth 
correct’’. 


Likewise, Milton wavers between both words, writing in 1643 ‘to save the 
phenomenon of our Saviours answer to the Pharises” (Divorce I,1) and in 
1667 ‘“‘when they come to model Heav’n / And calculate the Starrs, how they 
will weild / The mightie frame, how build, unbuild, contrive / To save ap- 
peerances, how gird the Sphear / With Centric and Eccentric scribl’d o’re, / 
Cycle and Epicycle, Orb in Orb” (Par. Lost VIII, 79-84). Therefore, it is safe 
to assume that the first Engl. instance of to save appearances (fig.) so far known, 
which dates back to 1711 and occurs in Steele, 


‘St often happened that a duel was fought to save appearances to the world, when both 
parties were in their hearts in amity and reconciliation to each other’’ (The Spectator, 
ed. G. A. Aitken, II, London 1898, p. 80), 


is a mere example of an expression much current at that period (also in con- 
nection with the duels that Steele criticized as a general and reprovable vogue), 
but which nevertheless existed in Engl. at a rather earlier date. As a matter of 
fact, the trend to replace appearance by phenomenon in the scientific language 
and its rapid success betray the fact that the former had become unsuitable as 
a technical term; and since this had not occurred before, though the word was 
extremely common from Chaucer onwards in non-technical acceptations, such 
a substitution shall be attributed to the increasing vogue of to save appearances 
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as a figurative locution. Acceptance of scientific expressions in the common 
speech does not necessarily lead to their unsuitableness and subsequent replace- 
ment in technical usage, but it is apparent that in this case the expression, 
once it became generally known in its figurative sense, was likely to impart a 
humorous shading to the scientific texts where it appeared in its proper mean- 
ing (as may actually be seen in the witty remark Sarpi reports). 

To the same effect one may add the figurative usage of Fr. sauver les apparences 
(philos. sense Amyot 1572), that cannot have passed unheeded in England: 
Moliére, Impr. 1 (anno 1663, later Bossuet, Fénelon, Massillon etc.), cp. sauver 
les dehors Corneille, Perthar. IV, 4, 1848-1350 (anno 1652) ‘“‘Pour trouver 4 me 
perdre une ombre de justice, / Et sauver les dehors d’une adroite vertu / Dont 
aux yeux éblouis tu parois revétu! ” (cp. les dehors “l’apparence”’ in Corneille, 
Imit. J.-Christ IV, Préface 11, anno 1656, “qui d’un pain visible emprunte les 
dehors”, Oeuvres, ed. Ch. Marty-Laveaux, VIII, Paris 1862, p. 579 and see also 
X, p. 156, 24-25). 

That being stated, let us now consider the expression to save one’s face, which 
is more or less synonymous of to save appearances in current Engl. usage and has 
also got into other European languages at a very recent date, though it cannot 
be regarded as a Europeanism: Fr. sawver la face (contr. perdre la face), Ital. 
salvare la faccia (contr. perdere la faccia), both current and of Engl. origin (the 
latter probably through a Fr. intermediary), Russ. spasti (or sohranit’) lico as a 
literal translation from Fr. still confined to written usage (B. M. Volin-D. N. 
Usakov, Tolk. slov. russ. jaz. Il, Moskva 1938, col. 77, and still lacking in most 
of the recent bilingual dictionaries). 

There is no doubt that to save one’s face has been developed from such Chin. 
expressions as tiu' lien* or shih! lien® “‘to lose face” (on which see e.g. H. A. 
Giles, Chin.-Engl. Dict.?, London-Shanghai 1912, p. 895 no. 7140 and p. 1183 
no. 9742). 

“Interesting as the Chinese physiological face is,’ writes L. Yutang (My 
Country and My People'*, New York s.d., p. 199-200), “the psychological face 
makes a still more fascinating study. It is not a face that can be washed or 
shaved, but a face that can be ‘granted’ and ‘lost’ and ‘fought for’ and ‘presented 
as a gift.’ Here we arrive at the most curious point of Chinese social psychology. 
Abstract and intangible, it is yet the most delicate standard by which Chinese 
social intercourse is regulated.” 

No wonder, then, that this face and the many devices to avoid its loss have 
struck the attention of the Engl. community in China, and that the expression 
to lose one’s face has found its way into Engl. (at a moment when China was of 
deep concern to the Western powers because of her rebellion against European 
spheres of interest), first referred to the Chinese, and then loosely used whenever 
an injury is inflicted upon someone’s honour (though Chin. lien? cannot be 
rendered by honour, cp. L. Yutang, op. cit., p. 200). So a memorandum drawn 
up by the Inspector General of Customs at Peking on Jan. 23, 1876, states: 


“The country [i.e. China] begins to feel that Government consented to arrangements by 
which China has lost face; the officials have long been conscious that they are becoming 
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ridiculous in the eyes of the people, seeing that where a foreigner is concerned they can 
neither enforce a Chinese right nor redress a Chinese grievance, even on Chinese soil’’ 
(R. Hart, These from the Land of Sinim, London 1901, p. 225). 


And the Westminster Gazette writes as early as 1898 that “unquestionably 
the process of saving one’s face leads to curious results in other countries than 
China” (quoted by the Ozford Dict. IX, p. 137 s.v. save, with further instances; 
ep. Suppl., p. 354 s.v. face). 

These two quotations prove that to lose one’s face has rapidly given rise to 
the expression to save one’s face, which is by far more common than the former 
in the European languages where both locutions exist. The interesting fact, 
however, is that to save one’s face does not occur in Chin., so that it certainly 
is a development which has taken place inside Engl., and for which no explana- 
tion has been offered so far. 

It is obvious, of course, that to save one’s face has developed on the basis of 
the opposition to lose—to save, but this does not mean that such an innovation 
had necessarily to arise (the very example of Chin. is a case in point). In other 
words, one thus explains how to save one’s face developed, but not why it de- 
veloped. 

In my opinion, this locution results from a contamination of to lose one’s 
face and to save appearances (which, of course, does not imply that appearances 
might not be saved by some other hypothesis). 

It is a commonplace remark that whenever a new element gets into a linguistic 
system, speakers more or less unconsciously endeavour to fit it into the system 
by assimilating it to preéxistent elements. In the case dealt with here, same as 
the Chin. face, though by no means exactly translatable, it could not help 
being framed into the standards of Western social life which seemed more 
similar to it, and was thus assimilated to honour, dignity, and the like, the 
whole expression to lose one’s face was certainly associated with others which 
more or less conveyed identical or opposite meanings. The English face had 
long been synonymous with appearance (see Oxford Dict. IV, p. 5-6 s.v.), so that 
to save appearances in figurative usage provided the model for constructing the 
expression opposed to to lose one’s face; and to this effect it must not be over- 
looked that to save one’s face is actually felt by the average speaker as a 
synonymous locution of to save appearances, i.e. face is here assumed to indicate 
outwardness and not actual honour or dignity. Cp. e.g. Shorter Oxford Dict. I, 
p. 666 s.v. face: “‘T’o save one’s f. (fig.): to save appearances; to escape a mani- 
fest ‘slap in the f.’ ” 

It will not be too unsafe to assume that it is from Engl. that to lose and to 
save one’s face have reached other European languages, because it is Engl. 
which offers the earliest evidences of both expressions being accepted in current 
usage, whereas the sporadic occurrence of Chin. idioms with “face” literally 
translated into Western tongues before 1876 proves by the very wording of the 
contexts that they do not sound immediately intelligible to the reader. Note 
e.g. their occurrence in the writings of E.-R. Huc, a French clergyman widely 
travelled in Tartary, Tibet, and China from 1829 till 1850: Souvenirs d’un 
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voyage dans la Tartarie etc., 1850 (ed. D’Ardenne De Tizac I, p. 135): “Quand 
quelqu’un s’est compromis, on doit éviter de le faire rougir, ou, en style chinois, 
de lui enlever la face. Aprés que nos paroles eurent mis 4 couvert toutes les 
figures’ etc., cp. transl. Hazlitt (London 1852, I, p. 116) ‘‘any observation 
which may make him blush, or as the Chinese phrase it, take away his face. 
After our conciliatory address had restored self-possession to all” etc., transl. 
New York 1855 (I, p. 110) “one must avoid putting him to the blush or in 
Chinese phrase, carrying away his face. When our words had ‘covered all their 
faces’ {translator’s italics]’”’ etc. (in the case of Fr., moreover, the fact must be 
stressed that the most approximate equivalent of Chin. lien* offered by the 
traditional lexical system would be rather front than face). 

As to the fact that Engl. to save one’s face has readily penetrated into Fr., it 
can be assumed that the homography of face in both languages played, here as 
in so many other cases of Anglicisms in Fr., a decisive role in the acceptance of 
the locution. As to Ital., it is worth while being noted that this tongue not only 
commonly uses faccia in the sense of ‘‘apparenza, quel che una cosa mostra di 
essere, per lo pit contrariamente alla realta: e dicesi pit che altro di cose morali’”’ 
(Vocab. Crusca V,2, p. 460 s.v. faccia §14), but also shows at a very early date 
a locution perdere la faccia “‘to lose one’s (sense of) decency, dignity’? which 
would prove immediately clear to any present-day speaker even if he had no 
knowledge of the identical expression ultimately originated by Chin., e.g. 
“‘vedete la meretrice, che ha perduta la faccia, . . .e non si cura di parere male” 
Fra Giordano da Rivalto 14th cent. (quoted by Vocab. Crusca V,2, p. 461 s.v. 
faccia §17), “‘l’uomo ripromette del prossimo suo; e quando questi avrae perduta 
la faccia, fia abbandonato da lui’ Eccles. XXIX, 23, transl. publ. 1471 (Vulg. 
“cum perdiderit reverentiam”, quoted by Tommaseo-Bellini II,1, p. 617 s.v. 
faccia 47); therefore the acceptance of the Engl. expression produced nothing 
but a slight semantical sliding of a locution *salvare la faccia already implied 
by the Ital. linguistic system. 

In other words, Engl. to save one’s face has shown so far no great force of 
expansion and has penetrated only into those tongues which, besides being in 
close touch with Engl., had also the possibility to render Engl. face by a word 
formally similar (Russ. sohranit’ lico already mentioned is of bookish origin). 
The diffusion has thus been possible in so far as there was no resistance to 
overcome. This may be clearly understood if one considers that the locution 
is completely unknown in Span. usage. It may well be that, some day, extra- 
linguistic factors such as the prestige of an individual or a stronger interest of 
Spain in Far Eastern affairs might force to save one’s face into Span., but in 
any case the Span. linguistic system will offer a greater resistance than Fr. and 
Ital., either in rendering it by salvar la cara (which rules out the possibility of 
an acceptance based on formal similarity with Engl. and Fr. face) or in trans- 
lating it, much more improbably, by salvar la faz: as a matter of fact, Span. 
faz has a pronounced poetical or scientific connotation in comparison with the 
elegant and elated rostro or the common cara, the latter being used ‘‘en locu- 
ciones que no admiten, como equivalente, ninguna de las otras palabras’’ (J. J. 
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de Mora, Coleccién de sinén. de la lengua cast., Madrid 1855, p. 40, ep. R. Ruppert 
y Ujaravi, Span. Synonymik, Heidelberg 1940, p. 198-199; note e.g. the corre- 
spondences Ital. aver faccia Span. tener cara, Ital. senza faccia Span. sin cara, 
Ital. con che or qual faccia Span. con qué or cudl cara, Ital. cader a uno la faccia 
Span. caérsele a uno la cara, Ital. sfacciato Span. descarado, etc.). 

In areas outside Europe it will be worth while noting that Chin. lien*® has 
an exact counterpart in Jap. kao “face’”’ of kao wo tsubusu “to dishonour, to 
disgrace”’ lit. “to break face”, kao wo tateru “to put honour upon”’ lit. “‘to raise 
face’, etc. (F. Brinkley-F. Nanjé-Y. Iwasaki, Unabr. Jap.-Engl. Dict., Tokyd 
1896, p. 583 s.v. kao), but that also this use of kao with the meaning of “fame, 
reputation” is recent (J. M. L. Lemaréchal, Dict. jap.-frang., Tokyo 1904, p. 345 
s.v. kao “nouveau sens’, which might have originated during or have been 
diffused by the Sino-Jap. war of 1894-1895). 
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TRAITS OPPOSITIONNELS ET TRAITS CONTRASTIFS 
Luis J. Prieto 


I 


1. Dans ses Principes Troubetzkoy appelle “‘phonémes”’ les “‘unités phonologi- 
ques qui, au point de vue de la langue en question, ne se laissent pas analyser 
en unités phonologiques encore plus petites et successives.’’! La différence fonda- 
mentale entre cette définition du phonéme et celle du Projet du Cercle Linguisti- 
que de Prague, qui le définit comme une “unité phonologique non susceptible 
d’étre dissociée en unités phonologiques plus petites et plus simples.’”? réside dans 
la notion de ‘“‘non-successivité” des traits composant un phonéme qui figure 
dans la définition de Troubetzkoy. 

Le phonéme, en effet, est susceptible d’étre dissocié en unités phonologiques 
plus petites et plus simples, celles qu’on appelle les “traits pertinents,”’ et ceci 
rend inutilisable ce que donne le Projet relativement & la distinction entre les 
phonémes et les groupes de phonémes. 

Partant de la définition de Troubetzkoy, on remarque que le caractére ‘‘non 
successif’’ des traits composant un phonéme?’ permet bien de faire la distinction 
entre un phonéme et un groupe de phonémes, mais ne permet pas de distinguer 
entre un phonéme d’une part et un trait pertinent ou un groupe quelconque de 
traits pertinents non-successifs de l’autre. Un trait pertinent, lui aussi, est bien 
une unité non susceptible d’étre dissociée en unités plus petites et successives; 
il en va de méme pour certains groupes de traits, comme par exemple, dans 
beaucoup de langues, ‘“‘sonorité + occlusivité,”’ dont on sait qu’ils ne sont pas 
des “‘phonémes.” En outre nous verrons plus loin que la définition du phonéme 
qui figure dans les Principes est également applicable au prosodéme. 


2. Supposons des points de différentes couleurs qui, disposés en ligne droite, 
forment certaines combinaisons. Ces combinaisons sont pourvues d’une signifi- 
cation. Deux de ces combinaisons ne se différencient pas nécessairement par 
opposition de particularités chromatiques. En effet, une combinaison ‘point 
bleu—point rouge” différerait de ‘‘point rouge—point bleu” par une disposition 
différente sur la ligne droite des mémes particularités chromatiques, plutét que 
par l’opposition de ces particularités chromatiques. Mais entre deux points, la 
seule opposition possible est celle de particularités chromatiques. Toute opposi- 
tion fondée sur une disposition différente des particularités chromatiques de ces 


1N. 8. Troubetzkoy, Principes de phonologie traduits par J. Cantineau, p. 37. Paris: 
Klincksieck, 1949. 

* Projet de terminologie phonologique standardisée, TCLP 4.311. 

3 Ce qui a été mis en relief, peut-étre pour la premiére fois, par M. J. Vachek dans ses 
articles Phonemes and Phonological Units, TCLP 6.235-240 et Can the Phoneme be Defined 
in Terms of Time?, Mélanges . . . offerts a J. van Ginneken, pp. 101-104. Paris: Klincksieck, 
1937. 
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points en est forcément exclue. On peut dire que dans ce cas le facteur espace se 
réduit 4 zéro. 

Nos points étant disposés sur une ligne droite, les oppositions possibles fondées 
sur la place variable des particularités chromatiques (+ et O) sur cette ligne 
droite seraient les suivantes: 1) une succession +O s’oppose 4 une simultanéité 
de + et O comme @; + et O étant successifs, une succession +O s’oppose A 
une succession O +. Ceci nous donne un schéma d’oppositions: +O/®, +O/ 
O+. Par contre, l’unique opposition possible entre deux points serait: + chro- 
matiquement différent de O, c’est-a-dire +/O. 


3. Un phénoméne analogue se manifeste dans la langue, la ligne droite que 
nous avons supposée pour les points étant le temps.‘ S’il est possible de distin- 
guer deux mots tels que l’espagnol sé “‘je sais” et es “‘il est,’”’ ce n’est pas & cause 
de l’opposition des particularités phoniques, puisque |’ensemble des traits phoni- 
ques pertinents est le méme dans chaque mot, mais bien par le fait que ces 
traits phoniques sont disposés d’une fagon différente, c’est-A-dire, par le fait 
que /s/ et /e/ sont deux “‘points,’”’? deux moments, dont la distinction et l’ordre 
de succession dans le temps sont pertinents. 

Mais, en analysant la chaine parlée, on peut aboutir 4 des sections de cette 
chaine dont les composants ne manifestent pas d’oppositions de cette nature. 
Ces sections seront les “points” phoniques, ou, pour mieux dire, les moments 
ou le temps, quelle que soit la réalité physique, est réduit 4 zéro au point de vue 
fonctionnel. Ce sont les phonémes, comme les définissent les Principes. Quand 
on dit que les unités phonologiques composant un phonéme ne sont pas succes- 
sives, on veut dire qu’entre deux phonémes on ne peut établir qu’une opposition 
du type +/O, c’est a dire, que sont exclues dans ce cas les oppositions des 
types +O/® ou +O/O+. 


4. Nous avons vu que la définition du phonéme proposée par Troubetzkoy 
nous permet de faire la distinction entre un phonéme et un groupe de phonémes, 
mais elle n’est pas utilisable pour établir la distinction entre le phonéme et les 
unités plus petites qui le composent. Mais ceci n’est pas |’essentiel; ‘‘l’essentiel,”’ 
dit André Martinet® ‘est de donner une représentation [de la langue 4 décrire] 
qui rende pleine justice 4 tous les éléments différenciatifs”; et ceux-ci sont les 
traits pertinents et leur disposition dans le temps. L’analyse en termes de pho- 
n>mes est donc admissible pour autant qu’elle rend justice 4 toutes les oppositions 
possibles fondées sur la disposition dans le temps des particularités phoniques 
pertinentes. Une fois ceci acquis, ce sont des raisons d’ordre pratique qui décident 
de la forme définitive 4 donner 4 cette analyse. Par exemple, dans beaucoup de 
langues, il est phonologiquement indifférent qu’on considére le complexe phoni- 
que [b] cor-me formé d’un seul phonéme caractérisé par les traits ‘occlusion 


4 Le temps de la phonologie n’est certainement pas le temps concret. Ce qui y importe 
c’est 1) qu’il n’y ait qu’une seule dimension, 2) qu’il soit possible d’y établir un sens positif 
et un autre négatif. Cf. J. Vachek, Mélanges . . . van Ginneken. pp. 103-104. 

5 A. Martinet, Ov en est la phonologie?, Lingua 1.48. 
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labiale’”’ + ‘sonorité,’”’? ou comme un groupe de deux phonémes dont |’un est 
caractérisé par l’occlusion labiale (ce phonéme serait alors identifié comme /p/), 
et l'autre par la voix (ou sonorité). Des raisons d’ordre pratique feront choisir, 
selon les cas, l’une ou l’autre solution. En tout cas /p/ et /sonorité/ n’entrent 
dans aucun schéme oppositionel des types +O/@ et +O/O+. Par contre, dans 
une langue ot un complexe phonique [d4] est en opposition phonologique avec 
un autre [na], il faut reconnaitre dans l’un ou dans |’autre complexe ou dans 
Yun et l’autre, deux moments; c’est-a-dire, on est obligé de reconnaitre les 
phonémes /d/, /a/ et /na/, ou bien /da/, /n/ et /a/, ou bien /d/, /a/, /n/ et 
/a/, parce que, si l’on ne fait pas un de ces découpages, c’est-a-dire, si l’on réduit 
le temps de [da] et [na] 4 zéro, il ne serait pas possible d’expliquer ce qui distingue 
ces complexes. 

La définition du phonéme proposée par Troubetzkoy est donc valable, mais 
avec cette remarque: il faut distinguer entre les complexes phoniques ot |’on 
ne peut pas découvrir d’éléments successifs du point de vue fonctionnel, et ceux 
ou cela est possible. Tenant compte de cette limitation, tout ce qui est compris 
entre le “complexe maximum’’ 4 temps zéro et le trait pertinent a le droit d’étre 
appelé “‘phonéme.”’ Dans la description d’une langue le choix, parmi les com- 
plexes phoniques qui ont le droit d’étre considérés comme phonémes, de ceux 
qui seront, en définitive, admis comme les “‘phonémes’’ de la langue en question 
sera déterminé, non par des raisons fonctionnelles, mais bien par des considéra- 
tions purement pragmatiques.* Par conséquent, il est possible d’avoir plusieurs 
descriptions phonologiques d’une méme langue qui seraient toutes également 
bonnes ou, du moins, également justifiables. 


5. Quels que soient les critéres sur lesquels on se fonde pour classer comme 
‘“‘prosodiques”’ certains traits pertinents,’ il faut avant tout que les traits perti- 
nents ‘“‘prosodiques,”’ qui caractérisent, selon les cas, une syllabe ou une more, 
ne puissent étre décomposés en unités phonologiques plus petites et successives. 
En effet, dans un mot comme |’espagnol miro “je regarde,”’ si l’on tient compte 
seulement des traits “‘prosodiques”’ ‘‘intense” de la syllabe mz- et ‘‘non-intense”’ 
de -ro, il faut forcement distinguer deux “‘points,” c’est-4-dire, considérer |’in- 
tensité et le manque d’intensité comme successifs, et ceci parce qu’il existe, & 
cété de miro, un mot miré “il a regardé” qui se distingue du premier par une 
opposition du type +O/O+. Par contre, on ne trouve jamais 4 cété d’un mot 
monosyllabique comme |’espagnol sé ‘‘oui,” un autre mot tel que, pour expliquer 
ce qui le distingue de st, il serait nécessaire de considérer dans |’intensité de 
celui-ci plus d’un “point.” 

Le ‘‘prosodéme’’® est donc lui aussi une unité phonologique 4 temps zéro et, 


6 Voir pourtant ci-dessous, §18. 

7 Nous essayerons d’exposer ci-dessous les raisons qui exigent un traitement séparé des 
traits prosodiques et des traits non-prosodiques. Elles n’apparaissent pas clairement dans 
les Principes. Voir ci-dessous, §18. 

8 Dans les Principes, pp. 212-213, le ‘‘prosodéme”’ est défini comme “‘la plus petite unité 
prosodique de la langue en question, autrement dit, la syllabe dans les langues qui comptent 
les syllabes et les mores dans les langues qui comptent les mores.’’? Pour Troubetzkoy, donc, 
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par conséquent, il entre aussi dans la définition du “‘phonéme” proposée par 
Troubetzkoy. 


6. Enfin, il y a des cas ov |’on pourrait interpréter l’opposition de deux com- 
plexes phoniques comme une opposition du type +O/@® ou +O/O-+ aussi bien 
que comme une opposition du type +/O. 

Nous avons considéré dans le paragraphe précédent l’opposition entre miro 
et miré comme une opposition du type +O/O-+. On y considére donc, comme 
pertinentes, l’intensité et le manque d’intensité, et l’on identifie l’intensité de la 
premiére syllabe de miro avec |’intensité de la seconde syllabe de miré, et le 
manque d’intensité de la seconde syllabe de miro avec le manque d’intensité de 
la premiére syllabe de miré. Mais il serait possible aussi de considérer comme 
pertinentes non l’intensité et son absence, mais la variation de l’intensité, qui est 
descendante dans miro et ascendante dans miré. I] y aurait en ce cas une opposi- 
tion de “variation d’intensité,” opposant l’intensité montante et l’intensité 
descendante, lesquelles, bien qu’elles impliquent le temp: pour se réaliser, seraient 
& temps zéro au point de vue fonctionnel. L’opposition de miro et miré corres- 
pondrait ainsi au type +/O. 

De méme les sons géminés peuvent étre considérés comme formant avec les 
sons simples correspondants une opposition du type ++/+; ce qui n’est qu’un 
cas particulier du schéma +O/@. Cette interprétation comporte l’identification 
du son simple avec chaque élément entrant dans la composition du son géminé. 
Mais on pourrait considérer comme pertinente la gémination elle-méme, qui 
serait alors une espéce d’intensité; de cette facon l’oppc ition “son géminé—son 
simple” appartiendrait au type +/O. 

Dans la plupart des cas, la premiére interprétation serait pratiquement 
préférable. Mais dans la “variation tonique’’ qui permet de discerner deux 
accents ou davantage, ainsi que dans la gémination des voyelles, la seconde 
interprétation pourra parfois s’imposer; elle sera d’ailleurs seule en harmonie 
avec l’explication d’autres faits.® 





le ‘“‘prosodéme’”’ semble étre, non l’ensemble de traits ‘“‘prosodiques”’ qui caractérisent la 
syllabe ou la more, mais la syllabe ou la more elles-mémes. Un mot comme |’espagnol miro 
a donc, d’aprés Troubetzkoy, les prosodémes mi- et -ro, l’intensité qui caractérise m7- et le 
manque d’intensité qui caractérise -ro demeurant sans désignation propre. A notre avis, 
pourtant, il est plus pratique d’appliquer le nom de ‘‘prosodéme”’ 4 l’ensemble des traits 
“‘prosodiques”’ qui caractérisent une syllabe ou une more. Dans ce qui suit nous lui don- 
nerons cette signification. D’ailleurs les résultats seront & peu prés les mémes: miro aura 
toujours deux ‘‘prosodémes,”’ qui seront les prosodémes /intense/ de mi- et /non-intense/ 
de -ro. 

® Cette premiére partie de mon exposition m’a été suggérée surtout par l’article fonda- 
mental d’André Martinet, Un ou deux phonémes?, Acta Linguistica 1.94-103. Je me permets 
pourtant de faire une observation 4 propos de cet article. On y base la solution du probléme 
“Un ou deux phonémes?”’ sur la possibilité de procéder & la commutation des sons ou des 
articulations qui conditionnent les groupes de sons (p. 95) ; or, deux sons successifs représen- 
teront avec certitude deux phonémes s’il sont tous deux commutables (p. 96). A notre avis, 
pourtant, s’il n’existe dans un groupe de sons successifs, aucune raison qui exige de con- 
sidérer leur “‘successivité’’ comme fonctionnelle, et non comme purement phonétique, il 
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II 


7. Nous avons caractérisé le phonéme comme |’unité phonologique 4 temps 
zéro. Nous allons voir maintenant si les phonémes (et aussi les ‘‘prosodémes’’) 
sont susceptibles, en se succédant dans le temps, de former des unités plus 
grandes. 

Nous savons que, d’aprés les Principes, on appelle “unité phonologique” 
chaque terme d’une opposition phonique distinctive:!° de l’opposition tausend— 
Tischler, il ressort que -ausend et -ischler sont des “unités phonologiques.” Il 
est évident que, dans ce sens, toutes les langues ont des unités phonologiques 
formées par la succession des phonémes dans le temps. Mais nous voulons donner 
a l’“unité phonologique” une signification plus étroite. Ce qui nous intéresse ici 
c’est de déterminer si, dans une langue donnée, les phonémes forment des groupes 
tels qu’il soit nécessaire de les considérer comme un tout au point de vue fonction- 
nel, de sorte que les phonémes n’auraient pas d’existence propre en dehors de ces 
groupes." 

Or, puisque le phonéme n’existe pas en dehors de la phrase,” ce qui nous per- 
mettra de découvrir l’existence d’une unité formée par la combinaison de pho- 
némes successifs, sera le fait que des groupes de phonémes ayant une forme 
phonique caractéristique remplissent dans la phrase un réle particulier, de sorte 
qu’on ne peut pas considérer la phrase comme composée d’un ou de plusieurs 
phonémes, mais comme composée d’un ou de plusieurs de ces groupes. Dans une 





n’est pas nécessaire d’y voir deux phonémes, méme si les composants du groupe en question 
sont commutables indépendamment les uns des autres. Soit, par exemple, une langue od 
il n’y a pas d’autres groupes de consonnes qu’occlusive plus r, ces groupes pouvant occuper 
les positions qu’occupent les consonnes simples. I] n’est pas nécessaire de considérer ces 
groupes, comme composés de deux phonémes, du moment que les oppositions telles que, par 
exemple, /pr/—/rp/ y sont exclues. On peut donc considérer ces groupes comme mono- 
phonématique, et la possibilité de commuter ses composants indépendamment les uns des 
autres ne serait pas différente de la possibilité qui existe dans /b/, par exemple, de com- 
muter la localisation ‘“‘labiale”’ indépendamment de la “‘sonorité’”’ (/b/—/d/—/p/—/t/). 
Certes, on peut au méme titre considérer /b/ comme biphonématique, c’est 4 dire, comme 
/p/ plus /sonorité/. Dans cette langue hypothétique, en effet, tout ce qu’on peut dire de 
/br/—/dr/—/b/—/d/ serait applicable & /b/—/d/—/p/—/t/ et inversement. I] nous parait 
donc que la possibilité de commuter une particularité phonique prouve seulement la perti- 
nence phonologique de cette particularité phonique (certainement, en tant que cette com- 
mutation implique un changement dans le plan du contenu). La n&écessiTé& (non la Pos- 
SIBILIT£) de considérer dans un groupe de sons successifs deux phonémes ou davantage 
n’existe qu’en tant que la ‘‘successivité’’ de ces sons est fonctionnelle, c’est 4 dire, en tant 
qu’elle est en opposition phonologique avec la ‘‘non-successivité’’ ou la ‘‘successivité dans 
l’ordre inverse”’ des mémes sons. I] semble d’ailleurs que Martinet, dans ses travaux ul- 
térieurs, suggétre, dans une certaine mesure, la méme fagon de voir. Cf., par exemple, 
Lingua 1.48, & propos de %. 

10P. 36. 

11 On est tenté de dire que les phonémes forment les ‘‘mots’’ dans le sens courant, et 
qu’ils n’existent pas ea dehors des mots. Mais le mot n’est pas ici une “‘unité phonologique”’; 
il n’est pas une unité du systéme phonologique, mais reléve de |’usage qui est fait, dans un 
autre domaine, des ressources phonologiques. Voir, ci-dessous, §20. 

12 Nous employons le mot “‘phrase’”’ dans le sens de ‘‘ce qui peut réellement étre dit.” 
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langue ot la phrase pourrait consister phoniquement en ‘‘un ou plusieurs pho- 
némes quelconques,”’ il n’y aurait pas lieu de considérer une autre unité que le 
phonéme, parce qu’on n’y pourrait pas signaler de groupes de phonémes jouant 
un role particulier. La, au contraire, ot l’on peut constater |’existence de groupes 
de phonémes tels que la phrase serait toujours formée d’un ou de plusieurs de ces 
groupes, le phonéme serait 4 considérer non comme |’élément composart directe- 
ment la phrase, mais comme faisant partie de ces groupes, qui deviendraient 
ainsi des réalités phonologiques en tant que groupes, c’est 4 dire, qu’ils entre- 
raient dans la catégorie des ‘‘unités phonologiques” au sens étroit du mot. 

De plus, il est possible qu’aprés avoir déterminé les unités dont se compose la 
phrase, on constate que ces unités ne se composent pas de phonémes, mais 
d’autres unités qui, 4 leur tour, sont décomposables en phonémes, et cela pour 
les mémes raisons qui nous ont amené & reconnaitre les unités composant la 
phrase. Théoriquement, la décomposition des unités peut se répéter plusieurs 
fois avant d’arriver aux phonémes. En fait, nous ne connaissons pas de langues 
ov il y ait plus de trois unités phonologiques au dessus du phonéme, 4 savoir la 
syllabe, la ‘‘racine’’® et le mot phonologiques. 

Quoiqu’il en soit, les unités formées par la combinaison de phonémes successifs 
se composent de plusieurs unités inférieures (qui peuvent étre des phonemes), 
et ce n’est qu’un cas particulier quand elles se composent d’une seule unité 
inférieure. En conséquence, nous les appellerons “‘unités (phonologiques) syntag- 
matiques,” les opposant ainsi aux phonémes, lesquels, en tant qu’appartenant & 
un paradigme de particularités phoniques distinctives, seront appelés ‘“unités 
(phonologiques) paradigmatiques.”’ 

Généralement la phonologie s’est occupée jusqu’a présent des unités paradig- 
matiques, laissant de cété la détermination et |’étude des unités syntagmatiques, 
bien qu’il existe, comme nous le verrons, une étroite relation entre les unes et 
les autres. 


8. Ce que permet le plus souvent de caractériser phoniquement une unité 
syntagmatique c’est l’existence d’un trait 4 “fonction culminative,” suivant la 
terminologie de Troubetzkoy. Le fait que certains groupes de phonémes contien- 
nent un seul de ces traits caractérise ces groupes du point de vue phonique. 
Mais |’élément essentiel, sans lequel le trait en question n’est pas culminant et 
les groupes de phonémes qu’il caractérise ne sont pas des unités syntagmatiques, 
e’est le rdle particulier joué par ces groupes, ce!** 7’étre l’unité qui entre dans la 
composition de la phrase ou d’une autre unité syntagmatique. 

En espagnol, ainsi qu’en tchéque, existent des groupes de phonémes carac- 
térisés par un et un seul phonéme ‘“‘intense’’;'* Mais en espagnol, toute phrase 
doit étre composée d’un ou de plusieurs de ces groupes. Par contre, en tchéque, 
ou il existe des mots comme lopata “‘pelle,’’'® qui n’ont aucun phonéme “‘intense,”’ 


13 Voir, ci-dessous, §11. 

1 Peu importe ici que l’intensité regoive en espagnol une réalisation tout différente de 
celle qu’elle regoit en tchéque. Cf. Principes, p. 208. 

18 Principes, p 213. 
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et peuvent cependant constituer 4 eux seuls des phrases, on ne découvre aucun 
role particulier 4 assigner aux groupes de phonémes caractérisés par un et un 
seul phonéme “intense.’’ Donc, en espagnol, le phonéme “‘intense’’ posséde une 
fonction culminative, et, par conséquent, la langue connait des unités syntag- 
matiques caractérisées par ce phonéme culminant, ce qui n’est pas le cas du 
tchéque. 

Dans bien des langues ce n’est pas un seul phonéme qui exerce la fonction 
culminative qui permet de caractériser une unité syntagmatique déterminée, 
mais deux ou davantage, de sorte que l’unité syntagmatique y est caractérisée 
par la présence d’un et d’un seul de ces phonémes. Un exemple bien connu est 
celui des langues qui ont deux ou plusieurs types d’accent culminant. Chacune 
des unités qui composent la phrase y est caractérisée par la présence d’un et d’un 
seul de ces accents. 

Dans ce qui suit, nous appellerons ‘“‘phonéme central” ou ‘“‘centre” d’une unité 
syntagmatique le phonéme culminant qui permet de la caractériser phonique- 
ment. 


9. Nous ne croyons pas qu’il existe de langue, comme celle que nous avons 
supposée ci-dessus, od la phrase se composerait directement de phonémes. Dans 
les langues qui possédent un ou plusieurs accents 4 fonction culminative, les 
unités entrant dans la composition de la phrase sont, comme nous venons de le 
voir, celles qui sont caractérisées par la présence d’un accent et d’un seul. Par 
contre, dans les langues qui n’ont pas d’accent 4 fonction culminative, le réle 
d’unité composant la phrase est joué par les unités caractérisées par la présence 
d’un et d’un seul des phonémes appelés “‘syllabiques,” lesquels exercent ainsi la 
fonction culminative. Nous avons vu que, dans les langues 4 accent culminatif, 
une phrase non minima (c’est 4 dir ‘ormée de plusieurs unités), est analysable 
en autant d’unités qu’elle contient d’accents, ces unités pouvant 4 leur tour 
étre des phrases. De méme, dans les langues qui n’ont pas d’accent culminatif, 
la phrase non minima est décomposable en un nombre d’unités égal au nombre 
des phonémes “syllabiques,”’ ces unités étant susceptibles du point de vue 
phonique——le seul qui nous intéresse ici'® de former des phrases a elles 
seules. 

Mais méme dans les langues qui ont un ou plusieurs types d’accent culminant, 
il existe des unités syntagmatiques caractérisées par la fonction culminative des 
phonémes ‘“‘syllabiques.”’ Il n’y a peut-étre pas une langue ov les unités qui 
entrent directement dans la composition de l’unité caractérisée par |’accent 
soient les phonémes. Si une telle langue existait, l’unité caractérisée par |’accent 
serait formée par “un ou plusieurs phonémes quelconques.” Le cas le plus 
fréquent est celui od l’unité caractérisée par l’accent est composée d’une ou de 
plusieurs unités caractérisées par la fonction culminative des phonémes ‘“‘syllabi- 
ques,” ces unités étant 4 leur tour composées de phonémes. Mais il y a des 
langues ov, 4 cété des unités caractérisées par l’accent et les unités caractérisées 





16 Voir ci-dessus §10 et surtout ci-dessus §20. 
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par les phonémes “‘syllabiques,” il y a un troisiéme type d’unité syntagmatique. 
Nous y reviendrons au §11. 

En tout cas, dans chacune des unités qui composent l’unité caractérisée par 
l’accent, celui-ci ne peut se trouver qu’en une seule position. A |’intérieur de ces 
unités on ne peut donc pas contraster deux points successifs au moyen de l’accent 
ou de son absence. Ainsi, on peut dire que, quelle que soit la réalité phonétique, 
l’accent ou le manque d’accent sont associés 4 ces unités tout entiéres, et non & 
un des éléments qui les composent (par exemple, 4 un phonéme). 


10. On admet partout que, en tant que l’accent a une fonction culminative, 
les phrases telles que l’espagnol él mismo “lui-méme” et el mismo “le méme’’” 
se distinguent, non par l’opposition des phonémes “‘intense’’ et ‘‘non intense” 
associés a él et el, ou par l’opposition des phonémes /é/ et /e/, mais par le fait 
que él mismo a deux “unités,” tandis que el mismo n’en a qu’une seule. Par 
contre, on admet en méme temps que deux phrases comme |’espagnol veo “je 
vois” et ven “‘ils voient,’’ ‘viens’ se distinguent par l’opposition des phonémes 
/o/ et /n/ et non par le fait que ven n’a qu’une unité, tandis que veo en a deux. 
Nous croyons cependant que la distinction entre él mismo et el mismo et celle 
entre veo et ven sont du méme ordre. 

Ce qui complique les choses c’est la relation du plan de |’expression avec le 
plan du contenu. II existe lA aussi une fonction culminative exercée par certains 
éléments, de sorte que chaque phrase sur ce plan est ou bien formée d’une 
“unité,” ou bien elle est analysable en autant unités qu’il y a d’éléments 
culminants, ces unités (les ‘‘mots’”) étant également capables de former des 
phrases. 

Ces unités du contenu correspondent dans bien des langues, sur le plan de 
l’expression, aux unités syntagmatiques qui composent la phrase. En ce cas, les 
unités qui composent la phrase sur le plan de |’expression, et qui peuvent, du 
point de vue phonique, étre des phrases a elles seules, peuvent |’étre aussi du 
point de vue du contenu. Tel est le cas de l’espagnol ot, 4 chaque unité syntag- 
matique caractérisée par |’accent, correspond toujours une des unités qui com- 
posent la phrase sur le plan du contenu, et inversement. Nous donnons, a4 des 
unités syntagmatiques qui composent la phrase sur le plan de |’expression, et qui 
correspondent & des unités du plan du contenu qui y composent aussi la phrase, 
le nom de “mot phonologique.”” Une unité caractérisée par |’accent semble 
toujours étre un mot phonologique. En outre, les langues ot tous les “mots” 
sont monosyllabiques doivent nécessairement avoir le “mot phonologique.” 

Il y a des langues ov les unités syntagmatiques qui composent le mot ont 
toujours un signifié, c’est 4 dire, qu’elles correspondent toujours 4 des unités du 
contenu. Ce sont les langues 4 accent secondaire pertinent que nous discuterons 
dans le paragraphe qui suit. 


17 D’autres exemples: él vino ‘‘il est venu,” el vino “‘le vin’’; équién es? ‘‘qui est-il?’’; 
équiénes? ‘‘qui?’”’ (pluriel). Dans ce dernier exemple, il peut y avoir, outre la différence 
d’accentuation ([kjén és]—[kjénes]), une différence dans la courbe mélodique interrogative, 
mais en tout cas cette différence serait conditionnée par la différence d’accentuation. 
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Dans les langues comme |’espagnol qui ont un mot phonologique caractérisé 
par l’accent culminant et qui n’ont pas d’accent secondaire, les unités composant 
le mot, c’est & dire, celles qui sont caractérisées par la fonction eulminative des 
phonémes “‘syllabiques,” n’ont un signifié que par exception. Ii n’y a donc pour 
ces unités aucune correspondance entre le plan du contenu et celui de |’ex- 
pression. 

Enfin, il y a des langues ov 1’on ne peut signaler aucun parallélisme entre les 
deux plans. Ce sont les langues qui n’ont pas d’accent culminant; leurs unités 
composant la phrase sur le plan du contenu, correspondent 4 une ou plusieurs 
des unités qui composent la phrase sur le plan de |’expression, c’est-a-dire, & 
une ou plusieurs unités caractérisées par la fonction culminative des phonémes 
syllabiques. 

Les unités syntagmatiques caractérisées par les phonémes syllabiques sont 
toujours composées de phonémes et jamais d’autres unités syntagmatiques. 
Elles sont, par conséquent, les unités syntagmatiques les plus petites. Nous les 
appellerons ‘‘syllabes phonologiques.”’ 

Dans bien des langues la syllabe est l’unique unité syntagmatique; le mot 
phonologique n’existe pas. Dans les langues comme le birman," ov tous les mots 
sont monosyllabiques, le mot phonologique et la syllabe phonologique coincident. 

En ce qui concerne la distinction entre él mismo et el mismo et entre veo et 
ven, On pourra nous objecter que ce qui fait la différence entre |’analyse de la 
phrase en mots et celle du mot en syllabes, c’est que la premiére se fait paralléle- 
ment sur les plans du contenu et l’expression, de sorte qu’elle nous fournit des 
unités qui peuvent étre elles aussi des phrases, sur un plan aussi bien que sur 
autre, alors que ceci n’est pas le cas pour l’analyse du mot en syllabes. Lorsqu’on 
parle de la fonction culminative, on entend en général le cas ov les unités syntag- 
matiques caractérisées par le phonéme culminant en question correspondent 
toujours & des unités de contenu. Ainsi, par exemple, lorsque Martinet écrit: 
“the permanent function of accent is what Trubetzkoy has called culmination. 
It shows how many full words the text is composed of,’’® Ici full words ne peut 
désigner autre chose qu’une unité de contenu. 

Mais ce recours au plan du contenu ne nous parait pas nécessaire. Qu’un 
“groupe de sons caractérisé par la présence d’un et d’un seul son déterminé” 
soit une figure phonique, ceci est évident. En tant que cette figure joue un réle 
particulier dans le systéme phonologique, c’est-a-dire, dans le plan de |’expression 
d’une langue déterminée, elle a une existence phonologique, elle est une “unité 
syntagmatique.”’ 

Or ces figures, suivant la langue, ou sont toujours des “‘signes,”’ ou elles peuvent 
étre ou ne pas étre des “‘signes,” ou elles ne sont jamais des “‘signes.’’ Mais ceci 
n’est pas strictement un fait d’expression; ceci dépend plutét de l’usage que le 
contenu fait de l’expression”® et, par conséquent, ceci ne change rien 4 |’interpré- 
tation du systéme phonologique lui-méme. 


18 D’aprés Troubetzkoy, Principes, p. 264. 
18 Phonology as Functional Phonetics, p. 12. London, 1949. C’est nous qui soulignons. 
2° Voir ci-dessus §20. 
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11. On sait que dans les langues qui ont un accent “secondaire,” comme 
l’allemand, seuls les mots composés peuvent se distinguer par la position de 
Vaccent principal. Les mots composés ont toujours, dans ces langues, outre 
accent principal, un ou plusieurs accents secondaires. L’opposition entre deux 
mots par la position de l’accent s’établit toujours par le fait que l’un a |’accent 
principal a l’endroit od l’autre a un accent secondaire et inversement. Ainsi 
s’opposent, par exemple, allemand iibersetzen [berzécen] “traduire,” et tibersetzen 
[yberzécen] ‘‘passer de |’autre cété.”” Par contre, une opposition de cette nature 
ne peut pas s’établir entre un mot “simple’’ et un autre mot quel qu’il soit. Par 
exemple, setzen ‘‘mettre’’ ne peut jamais s’opposer 4 un autre mot par la position 
de l’accent. On doit donc admettre que, méme quand la voyelle porte |’accent 
principal, elle a aussi, du point de vue fonctionnel, un accent secondaire coinci- 
dant avec celui-la, de sorte que l’accent principal se trouve toujours superposé a 
un accent secondaire. Or l’accent principal peut occuper dans le mot autant de 
positions qu’il y a d’accents secondaires. 

Il en résulte que, dans ces langues, les unités syntagmatiques qui composent 
le mot ne sont pas les syllabes, mais les unités caractérisées phoniquement par 
la présence d’un et d’un seul accent principal ou secondaire. Nous appellerons 
ces unités “racine phonologique.” Un mot est ou composé d’une racine ou 
analysable en deux ou plusieurs racines qui peuvent 4 elles seules constituer des 
mots, c’est-a-dire, recevoir l’accent principal. La racine, 4 son tour, est décom- 
posable en syllabes. 

Puisque l’accent principal ne peut occuper dans la racine qu’une seule position, 
on peut appliquer 4 la racine ce que nous avons dit de la syllabe au §9 in fine. 
L’accent principal est associé & une racine tout entiére. Les racines sont ainsi ou 
“‘accentuées” (avec l’accent principal) ou “‘non-accentuées’’, comme c’est le cas 
des syllabes dans les langues qui n’ont d’autres unités syntagmatiques que la 
syllabe et le mot. 


Ill 


12. Nous croyons avoir démontré |’existence des unités syntagmatiques. 
Nous allons maintenant essayer d’en tirer des conséquences. 

Un corollaire évident de la fonction culminative qui est dévolue aux phonémes 
centraux, est que LE PHONEME CENTRAL ET LES PHONEMES NON-CENTRAUX 
D’UNE UNITE SYNTAGMATIQUE PEUVENT COMMUTER DANS CETTE UNITE UNIQUE- 
MENT AVEC UN PHONEME DU MfME TYPE (c’est-A-dire, le phonéme central seu- 
lement avec un autre phonéme central, et les phonémes non-centraux avec 
d’autres phonémes non-centraux). 

Prenons par exemple la syllabe esp. el. Son phonéme central /e/ peut com- 
muter avec un autre phonéme central, /a/, /i/ etc.; le résultat sera toujours une 
syllabe. De méme, /l/ peut commuter avec un autre phonéme non-central, 
/n/, /r/ ete., ou aussi avec zéro. 

Or, si nous opposons deux syllases, le phonéme central de l’un peut commuter 
seulement avec le phonéme central de l’autre, et ceci vaut aussi pour les phonémes 
non-centraux placés avant ou aprés le centre. Par exemple, en opposant esp. 
el et si, /e/ peut commuter avec /i/, et inversement. On en a la preuve dans le 





eS. 


1eS 
UX 


u- 
Tec 


ine 
ral, 


ter 
nes 
Sp. 
3 le 





TRAITS OPPOSITIONNELS ET TRAITS CONTRASTIFS 53 


fait que il et se sont bien possibles, comme syllabes, en espagnol. En ce qui 
concerne les phonémes non-centraux, on peut remplacer, par exemple, le zéro 
qui se trouve avant le centre de el, par le /s/ de si et inversement, parce que sel 
et 7 sont possibles comme syllabes. Par contre, en commutant, par exemple, le 
phonéme central de el avec le phonéme non-central /s/ de si, et inversement, 
on obtient sl et ei (phonét. [ei]), qui certainement ne sont pas des syllabes. 

Par conséquent, la distinction de deux syllabes résulte de l’opposition de 
leurs phonémes centraux d’une part et celle de leurs phonémes non-centraux de 
autre. El et si se distinguent donc par les oppositions suivantes: 


zéro—/s/ 
/e/—/i/ 
/\/—zéro 


I] va de méme pour le phonéme central et les phonémes non centraux qui 
constituent le mot, c’est 4 dire, pour |’accent ou les accents et le manque d’accent. 
Dans un mot, l’accent se substitue 4 un autre accent, mais non au manque d’ac- 
cent, et inversement. Et puisque |’accent et le manque d’accent sont associés & 
des syllabes,”' on peut dire que la syllabe accentuée d’un mot peut commuter 
avec une autre syllabe ayant une forme phonématique quelconque, 4 condition 
que cette syllabe soit accentuée. Le méme chose vaut pour les syllabes non- 
accentuées. 

La distinction entre deux mots, comme par exemple esp. veo et ven se réduit 
ainsi aux oppositions suivantes: 





syllabe centrale* /b/—/b/ 
/e/—/e/ 

zéro—/n/ 
syllabe non-centrale /o/—zéro 


Autre exemple: asa ‘‘anse” et asta “hampe”’: 





syllabe centrale /a/—/a/ 
zéro—/s/ 

syllabe non-centrale /s/—/t/ 
/a/—/a/ 


Feliz ‘“‘heureux’’ et alto “‘haut’’: 








syllabe non-centrale /fe/—zéro 
/\/—zéro 

syllabe centrale /i/—/a/ 
/0/—/\/ 

syllabe non-centrale zéro—/to/ 





21 Voir ci-dessus §9. 
*< C’est-d-dire, associée au phonéme central du mot. 
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Miro et miré (qui sont des mots opposés par la “‘position de l’accent’’): 








syllabe non-centrale zéro—/mi/ 

syllabe centrale /ta/—/r/ 
/i/—/o/ 

syllabe non-centrale /ro/—zéro 


Enfin, on peut appliquer ce raisonnement aux langues ayant un accent secon- 
daire et, par conséquent, une racine phonologique. Deux mots, comme all. 
tibersetzen “traduire” et iibersetzen ‘“‘passer de l’autre cété’”’ se distinguent donc 
de cette facgon: 


racine non centrale /yber/—zéro 





syllabe centrale /z/—zéro 
racine centrale /e/—/y/ 





/¢e/—/b/ 
syllabe non centrale /e/—/e/ 


/n/—/r/ 





racine non centrale zéro—/zecen/ 





13. LES PARTICULARITES PHONIQUES QUI DISTINGUENT LES PHONEMES CEN- 
TRAUX DES PHONEMES NON-CENTRAUX NE SONT DONC JAMAIS LES TERMES D’OP- 
POSITIONS DISTINCTIVES. Méme dans les cas déj4 mentionnés comme ceux de la 
distinction entre veo et ven et entre él mismo et el mismo on est obligé de recon- 
naitre que l’opposition ne s’établit pas entre /o/ et /n/, ou entre /é/ et /e/, 
mais bien entre une unité phonologique et zéro phonique. Cela signifie que la 
distinction se fonde sur le fait que veo et él mismo ont chacun deux unités, tandis 
que ven et el mismo n’en ont qu’une. 

Par conséquent, les particularités phoniques qui distinguent les phonémes 
centraux des phonémes non-centraux appartiennent seulement au syntagme; 
elles ne forment pas de paradigmes d’oppositions phoniques distinctives. SEULES 
LES PARTICILARITES PHONIQUES QUi DISTINGUENT LES PHONEMES CENTRAUX 
OU LES PHONEMES NON-CENTRAUX D’UN MEME TYPE D’UNITES SYNTAGMATIQUES 
SONT LES TERMES D’OPPOSITIONS DISTINCTIVES ET, PAR CONSEQUENT, CES PAR- 
TICULARITHS SEULES FORMENT LES PARADIGMES D’OPPOSITIONS PHONIQUES 
DISTINCTIVES. 

Nous appellerons les particularités phoniques qui distinguent les phonémes 
centraux des phonémes non-centraux “traits pertinents CONTRASTIFS,’’* et les 
particularités phoniques qui distinguent entre eux les phonémes centraux d’une 
part, les phonémes non-centraux d’autre part, “traits pertinents OPPOSITION- 
NELS.”’ Les traits oppositionnels forment la définition paradigmatique du pho- 


23 On peut appeler aussi les traits contrastifs ‘‘traits constitutifs’’ en tant qu’ils consti- 
tuent les unités syntagmatiques. Les traits oppositionnels, par contre, opposent ces unités. 
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néme, les traits contrastifs sa définition syntagmatique. La distinction entre 
traits oppositionnels et la distinction entre traits contrastifs s’établissent sur 
des axes différents. En termes saussuriens, les traits oppositionnels seraient en 
rapport associatif, les traits contrastifs en rapport syntagmatique. 


14. Dans ses Principes Troubetzkoy nous dit qu’au point de vue phonologique, 
l’essentiel dans |’accentuation culminative est seulement “la mise en relief 
générale du prosodéme culminant .. . tandis que les moyens par lesquels cette 
mise en relief est obtenue appartiennent au domaine de la phonétique.” Cette 
appartenance & la phonétique pourrait s’étendre, 4 bon droit, 4 tous les traits 
contrastifs. 

De plus, il n’est pas nécessaire que le méme emploi syntagmatique soit toujours 
caractérisé par les mémes traits contrastifs: je doute par exemple qu’il soit 
possible de trouver un trait commun & tous les phonémes “‘syllabiques” et un 
trait commun & tous les phonémes “non-syllabiques.” Le fait essentiel est que 
s'il existe deux phonémes d’emploi syntagmatique totalement différent, mais de 
définition paradigmatique identique, l’ensemble des traits oppositionnels com- 
muns est associé, dans chacun des phonémes en question, & des traits contrastifs 
différents. En outre, les traits oppositionnels peuvent, en méme temps jouer le 
réle de traits contrastifs. 

L’accent et le manque d’accent, dans les langues qui n’ont qu’un seul type 
d’accent, illustrent ce qu’on peut entendre par deux phonémes de définition 
oppositionelle identique et de nature contrastive différente: le contenu opposi- 
tionnel du phonéme central du mot, c’est 4 dire de l’accent, est, dans ces langues, 
nul, et il en va de méme du contenu oppositionnel du phonéme non-central du 
mot, c’est 4 dire, du manque d’accent. L’“‘intensité”’ et le ‘‘manque d’intensité”’ 
n’y sont que traits contrastifs. 

Dans les langues qui ont deux types d’accent ou davantage, ces accents ont 
chacun un contenu oppositionnel, au moyen duquel ils s’opposent entre eux; 
mais le manque d’accent a, dans ces langues aussi, le contenu oppositionnel nul. 


15. Le contenu oppositionnel d’un phonéme est donc déterminé par |’opposi- 
tion de ce phonéme aux autres phonémes ayant la méme fonction syntagmatique. 
En espagnol, par exemple, le contenu oppositionnel de /a/ est déterminé par son 
opposition aux autres phonémes centraux de syllabes, c’est 4 dire, /e/, /i/ etc.; 
de méme, le contenu oppositionnel de /b/ est déterminé par son opposition aux 
autres phonémes non-centraux de syllabe, c’est-A-dire, /d/, /g/, /p/, /t/ ete. 
Or le contenu oppositionnel du phonéme réalisé généralement comme [i] et celui 
du phonéme réalisé comme [}j] sont bien différents, puisque dans le contenu de /i/ 
sont inclus les particularités phoniques qui l’opposent 4 /a/, /e/ etc., tandis que 
le contenu de /j/ est formé par les particularités qui l’opposent 4 /b/, /p/, /d/ 
et autres phonémes non-centraux de syllabe. La ressemblance de /i/ et /j/ est 
done purement phonétique, c’est une ressemblance entre les réalisations, non 


“Pp, 214. 
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entre les phonémes; le méme cas se présente dans d’autres langues pour r “‘syllabi- 
que” et r “non syllabique” etc. Pourtant cette ressemblance phonétique n’est 
pas sans conséquences pour la phonologie; nous nous en occuperons en une autre 
occasion. 


16. Un phonéme central] est done pistinct d’un phonéme non-central mais 
non OPPOSE & lui. Ceci semble en contradiction avec |’affirmation de Troubetzkoy 
que la ‘‘faculté de s’opposer est la condition fondamentale de |’existence phono- 
logique en général.’’”> Mais Troubetzkoy ne manque pas d’ajouter qu’une unité 
phonologique peut exister seulement si elle s’oppose 4 toutes les unités phonologi- 
ques DU SYSTEME EN QUESTION,”® et c’est précisément |’existence de plusieurs 
systémes que nous admettons. En somme, ceci n’est pas trés loin des systémes 
partiels qui résultent de la neutralisation des oppositions distinctives. Les 
archiphonémes sont évidemment des entités phonologiques distinctes des pho- 
némes correspondants, mais ils ne sont pas opposés 4 ces phonémes. Si cela est 
possible, c’est-A-dire, si les phonimes et les archiphonémes peuvent étre dis- 
tincts sans étre opposés, ¢’est parce que les uns et les autres n’appartiennent pas, 
4 strictement parler, au méme systéme. 


17. Les phonémes centraux et non-centraux de la syllabe forment donc deux 
systémes. L’un est composé des phonémes qu’on appelle “vocaliques” et com- 
prend parfois aussi d’autres phonémes comme les ‘‘consonnes faisant syllabe”’; 
Vautre est composé des phonémes qu’on appelle ‘“consonantiques” auxquels 
s’ajoutent parfois les ‘“‘semivoyelles.”’ 


18. Quand les traits pertinents d’un complexe phonique de temps zéro ont 
des fonctions syntagmatiques différentes, il est nécessaire de distinguer dans ce 
complexe autant de phonémes simultanés qu’il y a de fonctions différentes. 
Soit, par exemple, le mot espagnol mesa “table.’”’ Nous distinguons dans le com- 
plexe [é], qui en fait partie, deux phonémes simultanés: |’un dont le contenu 
oppositionnel est l’ensemble des traits qui distinguent /e/ des autres phonémes 
centraux de syllabe; l’autre, de contenu oppositionnel nul, reconnaissable au 
trait contrastif “intense.” 

Les phonémes sont done a classer selon qu’ils sont centraux ou non-centraux 
dans chaque unité syntagmatique de la langue en question. Pour autant qu’il y a 
deux ou plusieurs phonémes ayant la méme fonction syntagmatique, ils s’op- 
posent entre eux par leur contenu oppositionnel et forment ainsi des paradigmes 
de particularités phoniques distinctives. 

Nous allons maintenant comparer cette interprétation des faits phoniques 
pertinents avec la division des traits pertinents en “prosodiques” et ‘‘non- 
prosodiques.”’ 

Martinet a déja insisté sur le fait que ce n’est pas la différence dans la substance 
phonique qui exige une distinction entre les traits ‘‘prosodiques’’ et “non pro- 


% Principes, p. 82. 
26 Thid. 
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sodiques’’: “if we indeed distinguish between two chapters called respectively 
‘phonematics’ and ‘prosody,’ we do this, not because the substance of phonemes 
and that of prosodical features is objectively different.’””” Nous éliminons donc la 
possibilité de fonder la distinction en question sur la substance phonique. Mais 
le fait qu’une particularité phonique caractérise non un phonéme, mais bien un 
groupe de phonémes (une syllabe), ne suffit pas non plus, 4 notre avis, pour 
justifier la distinction entre des traits ‘‘prosodiques” et ‘non-prosodiques.”’ 
Pour autant qu’il y a des traits pertinents pour les phonémes centraux de syllabe 
qui ne sont pas pertinents pour les phonémes non-centraux, ces traits pourraient 
étre considérés comme caractérisant non le phonéme, mais la syllabe entiére. 
Soit, par exemple, une langue hypothétique od la nasalité et le manque de 
nasalité seraient pertinents pour les phonémes centraux de syllabe, mais non 
pour les phonémes non-centraux. Du moment qu’on ne pourrait pas, dans ces 
conditions, découvrir dans la syllabe des ‘‘points’”’ successifs caractérisés par la 
nasalité ou le manque de nasalité, on pourrait dire que la nasalité caractérise 
non les voyelles, mais les syllabes, qui en ce cas seraient ‘“‘nasales’’ ou non- 
nasales.’’ Nous ne croyons pourtant pas qu’on gagnerait rien 4 faire, dans cette 
langue, une distinction entre la “‘nasalité’”’ et les autres traits pertinents des 
voyelles, puisque tous les faits fonctionnels, dans le cas supposé, restent aussi 
bien expliqués en considérant la nasalité comme un trait ‘“phonématique” qu’en 
la considérant comme un trait “‘prosodique.”’ Ce cas se présente réellement dans 
les langues ayant, 4 titre non-culminatif, |’intensité, la gémination des voyelles, 
la hauteur musicale ou la variation de ton. Peu importe si, en tchéque par 
exemple, on considére |’intensité comme faisant partie du contenu oppositionnel 
des phonémes centraux de syllabe ou bien comme un prosodéme. 

Mais tout ceci se présente tout autrement lorsque les particularités ci-dessus 
ou d’autres sont les traits culminants du mot. Alors ces particularités ont une 
fonction syntagmatique différente de la fonction des autres traits pertinents 
coexistant avec celles-ci, et ceci exige, comme nous |’avons vu, leur traitement 
séparé. 

Nous croyons que le terme ‘‘prosodique”’ pourrait, si on le désire, étre appliqué 
aux phonémes dont la fonction appartient 4 des unités syntagmatiques plus 
grandes que la syllabe, c’est-a-dire, aux traits pertinents qui caractérisent le 
mot ou la racine phonologique. Mais dans les langues ou la syllabe est l’unique 
unité syntagmatique, le terme ‘“‘prosodique” nous semble tout 4 fait superflu. 


IV 


19. Nous avons jusqu’ici laissé de cété les faits d’intonation de la phrase 
dont nous n’avons pas l’intention de traiter en détail. Signalons toutefois que, 
peut-étre dans toutes les langues, |’intonation détermine l’existence d’une autre 
unité syntagmatique. Pour autant qu’on peut découvrir dans une langue une ou 
plusieurs courbes mélodiques ‘‘complétes,”’ la phrase a une de ces courbes mélodi- 
ques completes, ou elle est analysable en deux ou plusieurs courbes mélodiques 


7 Phonology, p. 10. 
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completes.” Or, dire que la phrase se compose de mots, qui peuvent 4 leur tour 
étre des phrases ne correspond pas exactement & la réalité. La phrase esp. El 
nitio va a la escuela ‘“‘L’enfant va & l’école,”’ par exemple, est analysable en trois 
mots, el nifio, va et a la escuela, mais ces mots séparément ne peuvent pas étre 
des phrases sans plus; pour le devenir ils doivent recevoir une intonation com- 
plete. 

La phrase donc, 4 strictement parler, se compose de ces unités syntagmatiques 
caractérisées par l’intonation compléte que nous appellerons ‘“‘unités d’intona- 
tion”; le mot, 4 son tour, est l’unité qui entre dans la composition des unités 
d’intonation. 


20. Le systéme phonologique d’une langue se présente ainsi comme un TOUT 
ARTICULE. Seules les unités d’intonation ont une existence indépendante. L’unité 
d’intonation se compose de mots, le mot de syllabes ou racines, suivant les 
langues, et la racine de syllabes. S’il y a plus de formes possibles pour l’unité 
d’intonation que pour le mot, et pour le mot plus que pour la racine ou la syllabe 
etc., ce n’est qu’une conséquence du caractére articulé du systéme phonologique. 
Et c’est précisément ce systéme dans son ensemble qui constitue l’objet de la 
phonologie. On ne peut pas étudier indépendamment la ‘“‘phonologie de la phrase” 
d’une part et la “phonologie du mot” de l’autre. Dans bien des cas il est. pratique- 
ment préférable de traiter séparément les ‘‘phonémes’”’ et les “‘prosodémes,”’ et, 
dans les ‘‘prosodémes,” de séparer ceux qui distinguent les phrases et ceux qui 
distinguent les mots. Mais il faut en tout cas tenir compte de la relation qui les 
unit comme les plans différents d’un méme systéme articulé. 

Ce systéme est un systéme d’expression, et, par conséquent, ce sont ses possi- 
bilités d’expression qui présentent un intérét. Elles sont, naturellement, au 
service de l’‘‘exprimé,”’ qui est, dans le cas de la langue, le contenu. Mais L’ USAGE 
QUE FAIT LE CONTENU DES POSSIBILITES D’EXPRESSION QUI LUI SONT OFFERTES 
PAR LE SYSTEME PHONOLOGIQUE, TOUT EN ETANT UN FAIT DE “LANGUE,”’ N’ENTRE 
PAS A PROPREMENT PARLER, DANS LE SYSTEME PHONOLOGIQUE LUI-M£ME, ET 
PAR CONSEQUENT RESTE EN DEHORS DE L’OBJET DE LA PHONOLOGIE.”® Ceci nous 


* Naturellement, il ne s’agit ici que des faits d’intonation pertinents. C’est le critére 
fonctionnel qui nous permet déja de constater si une courbe mélodique est compléte ou non. 
A ce propos il faut remarquer qu’il n’est pas nécessaire qu’une langue posséde plusieurs 
courbes mélodiques complétes en opposition entre elles pour que les faits d’intonation 
soient pertirents. Il suffit seulement d’une courbe mélodique compléte pour que l’intonation 
soit pertinente, puisqu’ainsi il est possible de distinguer entre une phrase composée d’une 
unité et une autre composée de plusieurs unités. C’est un fait analogue & celui de l’accentua- 
tion culminative & un seul type d’accent. L’intonation unique ainsi cue l’accent unique 
caractérisent des unités syntagmatiques et permettent de distinguer entre une unité et un 
complexe de deux ou plusieurs unités. Par conséquent l’intonation et l’accent seraient en 
ce cas pertinents bien qu’ils ne soient pas en opposition avec une autre intonation ou avec 
un autre accent respectifs. 

29 Tl nous semble que ceci doit servir 4 décider définitivement de la place qui revient & 
la Kombinationslehre dans la phonologie. Cf. les objections faites par E. M. Uhlenbeck & 
Troubetzkoy dans Lingua 2.242-243. A ce qu’il semble, Uhlenbeck n’accepte pas, en dehors 
des phonémes, d’autres unités strictement phonologiques, c’est-A-dire, définissables sans 
recours au plan du contenu. 
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permet de dire avec certitude, une fois connu le systéme phonologique, si tel ou 
tel groupe de sons peut étre un mot ou non dans une langue, et cela indépendam- 
ment du fait qu’il l’est réellement ou non, c’est-4-dire, qu’il est “utilisé” ou non 
par le contenu.*° 

Nous touchons ainsi au probléme de la détermination de ce qui est impossible 
et de ce qui est possible mais non réalisé. Si ce probléme existe, c’est parce qu’on 
ne peut pas connaitre le systéme phonologique, sinon en tant qu’il est “utilisé” 
par le contenu et parce que certaines parties de ce systéme ne sont pas assez 
“ytilisées’’ pour nous permettre d’en connaitre le contour précis. 


Roma 286 
Cérdoba, Argentina 





8° De plus, ceci confirme ]’existence phonologique des unités syntagmatiques. 








ON THE DEVELOPMENT OF LATIN STOPS IN ARAGONESE 
Re 3ert L. Pouirzer 


The retention of Latin un. ied intervocalic stops in High Aragonese and 
some Gascon dialects has always been a favorite subject of investigation and 
discussion.' The reasons for the interest are twofold: the retention of Latin 
intervocalic stops in those dialects presents the only definite exception to their 
general voicing in all of France, Spain, and Northern Italy,? the sort of exception 
to a general rule that has always held a certain fascination for linguists; in 
addition there is the justified hope that the discussion of the exception may 
shed some light on the larger and more important process to which exception 
is being made. 

The retention of unvoiced stops occurs in the Pyrenees, in the territory ad- 
joining the Basque area. There seems to be fairly general agreement that there 
is some connection between the phenomenon and Basque. Just how the Basque 
influence made itself felt is quite problematical. 

The theory of Basque influence was formulated first by Saroihandy, and later 
fully accepted by Rohlfs in his well known monograph on Gascon.’ Both Saroi- 
handy and Rohlfs connect the retention of the unvoiced stops with the supposed 
presence of aspirate intervocalic unvoiced stops in ancient Basque. The Basque 
speakers, so they claim, pronounced Latin unvoiced intervocalic stops as as- 
pirates. The aspiration in turn prevented the voicing. 

Structural linguists have quite recently given a different interpretation to the 
whole problem. Juilland and Haudricourt, in their structural history of French 
phonology, discuss at great length the voicing of intervocalic stops in the Ro- 
mance world, and describe it as part of a structural chain reaction: Latin doubles 
simplify, single unvoiced become voiced, and voiced stops spirantize (i > t, 
t > d,d > 3). They also argue that the existence of intervocalic geminates is 
the prerequisite for this chain reaction. The voicing takes place only if a new 
unvoiced (derived from the simplification of the geminate) can take the place 
left vacant in the system by the voicing of the original unvoiced stop. From 
these theoretical premises they argue that the Basque language lacks geminates, 


1See for instance A. Kuhn, ‘‘Der hocharagonesische Dialekt,’? RLiR 11(1935).72ff; 
W. D. Elcock, De quelques affinités phonétiques entre l’aragonais et le béarnais, (Paris, 1938). 
For a brief summary of the problem, see W. von Wartburg, Die Ausgliederung der romani- 
schen Sprachrédume (Bern, 1950), 32-33. 

2 Mario A. Pei, ‘‘Intervocalic Occlusives in ‘East’ and ‘West’ Romance,’ Romanic 
Review 34(1943) .235-247, gives various other examples of retention of Latin unvoiced inter- 
vocalic stops in the Western Romance world. The discussion, however, deals with indivic.ual 
words rather than with linguistic structure. 

3 Jean Saraoihandy, ‘‘Vestiges de phonétique ibérienne en territoire romane,’’ Revue 
intérnationale des Etudes Basques, 7(1913).475-497. G. Rohlfs, Le Gascon, Beiheft zur ZRPh 
85 (1935) .83ff. 
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and that this lack of geminates spread into the Latin spoken adjacent to the 
Basque territory and prevented the voicing of the intervocalic stops.‘ 

Martinet, dealing indirectly with the same problem in a series of articles, 
favors a different interpretation. He too sees the voicing of intervocalic stops as 
part of a structural chain reaction. He is much more cautious in establishing the 
voicing as a generally possible phenomenon dependent only upon the existence 
of geminates. He rather intimates that the voicing may be connected with specific 
conditions present in Celtic and suggests rather tentatively a connection between 
the Romance voicing of intervocalic stops and a similar process in Celtic.5 As 
to the retention of the stops in Aragonese he suggests that it is due to the preser- 
vation of a pattern imposed upon Latin by Basque. For ancient Basque he re- 
constructs a pattern which did not oppose voiced stops to voiced continuants. 
Lenes appear initially as stops and intervocalically as continuants: a phonemic 
opposition /p/—/b/ appears intervocalically as [p]—[8]. This pattern accounts 
for the spirantization of the Latin intervocalic voiced stops in Aragonese without 
accompanying voicing in unvoiced stops. This is also the pattern which eventu- 
ally became established in many Spanish dialects including Castilian. There, 
however, it became generalized only after the voicing of Latin unvoiced stops 
had taken place. In the area adjacent to Basque the pattern was adopted before 
the general Western Romance voicing.*® 

Ronjat has argued for a different theory: with specific reference to the Gascon 
rather than the Aragonese side of the problem, he declares that the supposed 
retention of the Latin unvoiced stops is really their restoration, since they had 
originally gone through the general voicing process.’ In structural terms, Ron- 
jat’s argument could be phrased as follows: after the voicing, the Latin spoken 
close to Basque opposed, in intervocalic position, voiced stop and voiced con- 
tinuant. However, Basque speakers knew in their language only the opposition 
unvoiced stop-voiced continuant, and therefore replaced the Latin voiced stop 
by an unvoiced stop. Such a process does not seem completely unfeasible. At 
any rate the main argument advanced so far against Ronjat’s theory can be 
questioned: Rohlfs argues that if the unvoiced stops of Aragonese and Gascon 
were restored voiced stops, the original voiced stops of Latin would also have 
been affected by this restoration and have become unvoiced.’ This argument is 
unconvincing, for the original Latin voiced stops had spirantized, and would 
not have been affected by an unvoicing of voiced stops. 


4A.G. Juilland and A. G. Haudricourt, Essai pour une histoire structurale du phonétisme 
francais (Paris, 1949), 51-52. 

5 A. Martinet, ‘‘Celtic Lenition and Western Romance Consonants,’’ Language, 28(1952). 
192-218. 

* For Martinet’s argument see ‘‘De la sonorization des occlusives initiales en Basque,” 
Word, 6(1950) .224-234, and ‘“‘The Unvoicing of the Old Spanish Sibilants,’’ Romance Phi- 
lology, 5(1951-52) .133-156, esp. 146. 

7 Jules Ronjat, Grammaire-istorique des parlers provengauz modernes, Vol. 2 (Montpellier, 
1932), 73. 

8 Rohlfs, op. cit., 88. 
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Elcock devoted the larger part of an entire book to the problem under con- 
sideration. After a thorough examination of the entire evidence from the point 
of view of linguistic geography, he polemizes against the Saroihandy-Rohlfs 
explanation, casting doubt upon the existence of aspirated stops in ancient 
Basque. He also rejects the Ronjat theory, repeating the argument of Rohlfs 
which we have given above. Elcock finally professes ignorance as to the ultimate 
causes of the phenomenon.°® 

It is not easy to reach any conclusion about the various theories mentioned. 
Present-day knowledge of ancient Basque is quite uncertain, which makes it 
difficult to give definite proof for the Saroithandy-Rohlfs explanation. The 
structuralistic explanation of Juilland and Haudricourt does not give sufficient 
reason for the spirantization of voiced Latin stops in high Aragonese, nor does 
it prove the supposed early simplification of geminates in that dialect. Martinet’s 
explanation, on the other hand, appears weakened by the fact that geminates 
do simplify in Aragonese and Gascon. The retention of the unvoiced stops, as a 
matter of fact, seems to affect the middle link of the chain # > t,t > d,d >3d. 
This in turn gives credence to the Ronjat hypothesis of a restoration of the voiced 
to unvoiced. 

The earliest Aragonese documents do show the preservation of the Latin 
unvoiced stops.’° But the documents are of the 11th century. Since the general 
voicing of the intervocalic stops is, by some authors at least, assumed to have 
taken place as early as the 3rd century," the retention of unvoiced stops in the 
11tin century documents may not give us any clues as to the rocess responsible. 
Yet a short study of the quantitative relations of certain phenomena in the 
Spanish 11th-century documents will nevertheless help to elucidate the problem. 

Menéndez Pidal, in the latest edition of his Ortgenes, publishes a very in- 
teresting random sample of late Latin documents from Spain. The documerts, 
written from approximately 980 to 1090, are composed in an artificial, notarial 
Latin, interspersed with popular words showing Romanization, far beyond the 
generally still Latinized text. These documents are obviously not replicas of 
the spoken language of the period, but we can expect, as Menéndez-Pidal in his 
Origenes certainly does, that they somehow reflect actual linguistic usage.” 
The documents are arranged in geographic order from West to East and grouped 
into Leonese, Castilian and Aragonese. We shall consider the documents in the 
same groupings, and within each group we shall determine the relative frequency 
of the following phenomena: 1) voicing of intervocalic stops, 2) simplification of 


* Elcock, op. cit., 22, 128ff, and 183. 

10 R. Menéndez Pidal in his earlier edition of the Ortgenes felt that the retentions were 
“cultismos.’’ In the latest edition he seems to think that they represent real linguistic 
usage. See Menéndez Pidal, Origenes del espafiol, 2nd ed., Madrid, 1929, 257, and 3rd ed., 
Madrid, 1950, 250ff. 

11 Von Wartburg, op. cit., 32ff. 

12 For a discussion of the nature of the documents and the reflection of popular usage 
in them, see Menéndez Pidal, op. cit., 3rd ed., vii-viii. Since the documents are written in 
an artificial petrified form of Latin, they may, of course, reflect popular usages of periods 
preceding the one in which they were actually composed. 
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geminates, 3) the substitution of b for initial v, 4) the substitution of 6 for in- 
tervocalic v. 

The reasons for this choice of phenomena are the following: the voicing of 
intervocalic stops and the exception to it is the problem under discussion. The 
simplification of geminates is supposed to be directly connected with the voicing 
as part of the same structural process. The substitution of b for initial v is charac- 
teristic of a Basque pattern ignoring the opposition of voiced stop/voiced con- 
tinuant. The substitution of b for intervocalic v is added for the purpose of com- 
parison with the initial substitution. For the merger of intervocalic b and v is, 
unlike the merger of initial b and v, a general Romance phenomenon. The sub- 
stitution of intervocalic b for v should thus occur also in dialects which keep 
initial b and v quite separate. Of course we must hope for results from the gen- 
eral statistical picture rather than from isolated misspellings. We cannot expect 
any direct correspondence between the letters of the Spanish scribes and the 
phonemes of their language. Thus a scribe may use the double ss as a symbol 
indicating unvoiced s as opposed to voiced s long after the simplification of 
geminates occurred. In evaluating the simplification of geminates we must thus 
keep in mind that we are looking for general uncertainty regarding the presenta- 
tion of geminates of Latin, rather than for a phonemic interpretation of individual 
letters. As for the liquids and nasals, it was decided not to include them in our 
count, since the palatalization process which Latin ll and nn undergo in the 
various Spanish dialects is not directly comparable to the simplification of e.g. 
tt to t and also makes the interpretation of the orthographies in the documents 
even more difficult. 

The following table gives the tabulation of the frequency of the above-men- 
tioned phenomena in the documents. The first figure indicates the instances of 
correct usage according to the Classical Latin standard, the second figure the 
deviations from that standard with the approximate percentage of deviation in 
parentheses. I should also like to point out that in counting I have omitted proper 
names, unless they were of obvious Latin origin, and a few words the etymology 
of which I was unable to determine and a discussion of which would have added 
nothing to the problem here presented, while changing the percentage figure by 
very little. 


Leonese Castilian Aragonese 
Voicing of intervocalic stops 100:20 (17%) 84:3 (3%) 39:0 
Simplification 10:16 (62%) 10:1 (9%) 7:1 (12%) 
b for v- 76:1 (2%) 47:3 (6%) 0:24 (100%) 
b for -v- 19:17 (48%) 14:13 (48%) 0:6 (100%) 


Some examples follow (pages refer to the Origenes, line numbers to the line of 
the document on the page). 
Voicing of stops: 
In Leonese: nodicia (24.3), prado (29.11), plaguit (31.16). 
In Castilian: semdario (33.19—note that this could also be nt>nd after 
syncopation of the vowel: semitero> sentero> sendero). 
Simplification of geminates: 
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In Leonese: redat (30.10), apate (24.3), Boca (30.2). 
In Castilian: regresus (33.7). 
In Aragonese: bakas (43.26). 
b for v-: 
In Leonese: bacelare (23.2). 
In Castilian: billa (33.3), bassallos (36.17), Bado (33.27). 
In Aragonese: bineas, bolumtate (40.6), bostru (42.7). 
b for -v-: 
In Leonese: lebaron (25.2), abolas (30.25). 
In Castilian: transibit (36.6), partiberunt (36.5). 
In Aragonese: comparabit (41.30), bibo (43.15). 

The results of the tabulation show the following: there is evidently a correla- 
tion between the voicing of the intervocalic stops and the simplification of the 
geminates. Both reach their maximum in the west and spend themselves toward 
the east. The percentage of simplification in Aragonese, it is true, is somewhat 
higher than in Castilian. But actually we are faced in Castilian as well as in 
Aragonese with only one sporadic occurrence which seems to correlate well 
with the absence of voicing in Aragonese and its extreme scarceness in Castilian. 
The Basque pattern, showing the b/v confusion in initial position, is definitely 
well established in the Aragonese documents. In Leonese and Castilian we find 
only isolated words showing the b substitution. The pattern as such has not yet 
been affected. In all positions we find a 100% substitution in Aragonese: v as a 
separate phoneme has ceased to exist. In Castilian and Leonese, on the other 
hand, there is the approximate 50% substitution that one might expect. The 
two phonemes still alternate initially, but intervocalically the opposition is 
neutralized, and either letter is used with approximately the same frequency. 

The findings point to the following conclusion: 1) there is apparently a direct 
connection between the simplification of the geminates and the voicing of inter- 
vocalic stops as argued by the structuralists. 2) The retention of the intervocalic 
unvoiced stops in Aragonese is evidently not connected with an early simpli- 
fication of geminates. 3) The voicing of intervocalic stops and the simplification 
of geminates proceeded from West to East.'* Voicing never did reach the high 
Aragonese area where a Basque pattern imposed itself before the voicing of 
intervocalic stops occurred. The simplification of geminates in Aragonese must 
thus have been a comparatively late phenomenon, perhaps imported from the 
adjacent Castilian. Thus of all the opinions examined, the one offered by Marti- 
net appears the most plausible. 

Some further conclusions not directly connected with the problem under 
discussion, but of somewhat wider implication, seem to be the following. 1) The 
voicing of intervocalic stops in the Western Romance world appears to be a 


13 The gradation of the frequency of the voicing from West to East was of course also 
noticed by Menéndez Pidal. He also pointed out that the western part of Spain was the 
center of the Celtic substratum (op. cit., 256-257). The general distribution of voicing in 
nascent Spanish seems to furnish further support for Martinet’s suggestion of a possible 
connection with Celtic. 
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much later phenomenon than is suspected by many authorities. Certainly there 
seems to be no justification for assuming an early 3rd century Western Romance 
unity based on the treatment of Latin intervocalic stops. (Cf. footnote 11.) 
For even if the documents reflect a popular usage which antedates the actual 
time of their composition (cf. footnote 12), the differences in the treatment of 
intervocalic stops shown in the documents here examined must be characteristic 
of a period which post-dates the 3rd or 4th century quite considerably. 2) The 
quantitative correlation of phenomena observed in textual analysis appears to 
be a useful and in historical linguistics perhaps quite essential aid to the structural 
approach. 


Harvard University 








ON THE GENDER OF EARLY ENGLISH LOANWORDS IN WELSH 
Rospert A. FowKeEs 


In his thorough study of the English Element in Welsh (Cymmrodorion Record 
Series, No. X, London, 1923), T. H. Parry-Williams designated a number of 
Welsh nouns as borrowings from Old English, and, with one or two exceptions, 
his designations may be accepted as valid. The words include: abad m. ‘abbot’, 
bad m. ‘boat’, berman m. ‘barm’, betws, bettws ‘chapel; secluded spot’ (in place 
names: Bettws-y-Coed, Betws Garmon), bwrdd m. ‘table’, camp f. ‘feat’, capan 
m. ‘cap, cape’, carl m. ‘churl’, casul m. ‘chasuble, cloak’, cist f. ‘chest’, cnul m. 
‘knell’ (a competing form clul may be the result of assimilation), cusan m. and 
f. ‘kiss’, crupl, crupil m. ‘cripple’, cwpan m. and f. ‘cup’, dwst m. ‘dust, powder’, 
edling, edlin m. ‘heir apparent’, ffald f. ‘fold’, ffordd f. ‘road, way’, ffwrdd m. 
‘road, way’, ffyrling (earlier ffyrlling, ffyrddling) f. ‘farthing’, grwnt m. ‘ground, 
foundation’, gwalstod m. ‘interpreter’, hebog m. ‘hawk, falcon’, het f. ‘hat’, hosan 
f. ‘hose, stocking’, llidiart f. ‘gate’, lloc m. ‘pen, fold’, llyffethair, llyffethar f. 
‘fetter, shackle’ (‘long fetter’?), offrwm m. ‘offering’, pabi m. ‘poppy’, prés m. 
‘brass’, punt f. ‘pound’, rhaca m. ‘rake’, rhawd f. ‘road, course’, sidan m. ‘silk’, 
suran f. ‘sorrel’, tarian f. ‘shield, buckler, target’, tunnell f. ‘ton’ (also ‘tun’), 
turn m. ‘turn, lathe’, wermod f. ‘wormwood’, wila m. ‘outlaw’, ysgadan pl. ‘her- 
rings’, yslén f. ‘jug, ewer, milking pail’ (Parry-Williams, 22-47). 

Of these, abad could possibly be from Latin (as J. Loth thought, Les mots 
latins dans les langues brittoniques, Paris, 1892, p. 129, but with the qualifying 
remark that its form indicates a date subsequent to the Roman-British period), 
although the expected form of a Latin borrowing would be *abawd, cf. H. Lewis, 
Yr Elfen Ladin yn yr Iaith Gymraeg, Cardiff, 1943, p. 6, 32; Lewis calls it a 
learned borrowing; the currently appearing dictionary Geiriadur Prifysgol Cymru 
(Cardiff, 1950—-) cites (p. 1) both OE and Latin as possible sources. The word 
carl may be directly from Norse (whence it was taken into OE, in any case). 
And turn could conceivably be of a later date. That several of the OE words 
are themselves borrowings (from Latin, Norse, etc.) has no immediate bearing 
upon the present subject. 

As frequently happens with borrowings, these words show a certain amount 
of variance from their sources with respect to gender. The purpose of this in- 
vestigation is to see whether reasons can be found for that variance. Transition 
from a language like OE, with a three-gender system, to one like Welsh, with a 
two-gender system, can account for the changes only in part. All Brythonic 
languages have reduced the Celtic three-gender system to one of two genders, 
as, indeed, has Goidelic also, although OIr. had m., f. and n. No neuter gender 
is directly attested in the nouns in any historical period of Welsh, Cornish or 
Breton. Yet there is definite evidence for a Brythonic neuter in those Latin words 
transmitted by Old Brythonic to Irish, a process which occurred early enough 
for Latin neuters to retain their gender in OIr., ef., e.g., OIr. ér n. ‘gold’ (from 
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the pre-Welsh equivalent of aur, which, in turn, is from Lat. aurum). Thus, 
Brythonic seems to have retained a neuter gender at least until the date at which 
such words were taken into Irish. Welsh aur is now masculine (as was Middle 
Welsh eur), cf. Pedersen, Vergl. Gramm. d. kelt. Spr. 2.66-67. If, however, the 
loss of the neuter were the sole factor involved in the changes, we might expect 
to find merely a redistribution of those words which were neuter in OE, some 
becoming masculine, the others feminine, or all being assigned to one gender. 
But this did not occur. Some OE masculines become feminine in Welsh, and 
some feminines become masculine. It remains to examine the pattern of gender 
in Welsh itself to see how it may account for the variation, as well as to ascertain, 
when possible, the reason for retention of original gender in those instances 
where there was no change. 

In Welsh, certain vowels have come to be associated with specific genders of 
noun and adjective, especially (but not exclusively) when alternating in pairs. 
This situation has been determined by historical phonological development. 
As is well-known, Celtic short 7 and u underwent a lowering to e and o before 
a final a, a process which is conspicuously reflected in Welsh today. (It is seen 
in old Breton and occurs sporadically in Cornish, but Middle and New Breton 
have lost the distinction through generalizing e and 0). Thus, to Celtic m. *yindos, 
f. *yinda ‘white’ there correspond Welsh m. gwyn, f. gwen, and for Celtic m. 
*dubnos, {. *dubna ‘deep’, Welsh has m. dwfn, f. dofn. Consequently, a consider- 
able number of feminine nouns and adjectives are characterized by e, 0, where 
corresponding masculines show y, w (reflexes of unaltered 7, uw), e.g.: cryf: cref 
‘strong’, byr: ber ‘short’, crwn: cron ‘round’, trwm: trom ‘heavy’, hogyn ‘boy’: 
hogen ‘girl’. Similarly, drws ‘door’ is masculine and dor, with the same meaning, 
is feminine (historically a neuter, as shown by OIr. dorus n.). These vowels have 
therefore come to be regarded as morphological characteristics or signs of gender 
(even though @ was not entirely monopolized by feminines). Loth has already 
demonstrated the importance of this for Latin loanwords in Brythonic (Les 
mots latins, 220-21). It is not surprising, then, to find that in many of the OE 
nouns borrowed in Welsh the gender assumed by them depends on the nature of 
their vocalism. The following, having w, are masculine: betws (OE bed-his n.), 
bwa (OE boga m.), bwrch (OE burh f.; Breton bourc’h f. is a loanword from French 
bourg, the latter being itself, of course, from Germanic), grwnt (from OE grund 
m.), dwst (OE dust n.), offrwm (OE offrung f.). The word cwpan is m. in South 
Welsh but generally f. in the North. In the Welsh Bible it is always m., and there 
are indications that m. is the earlier gender (cf. D. Silvan Evans, Geiriadur Cym- 
raeg, Carmarthen, 1887-96, p. 964). It is from an oblique case of OE cuppe, weak 
f. (Middle Welsh cwpa ‘cup; pint measure’ is m.). The presence of a vowel o 
results in feminine gender in Welsh for ffordd ‘road’ and bord ‘table’ (from OE 
ford m. and bord n.); bwrdd and ffwrdd, which imply a different chronological 
level of borrowing, although they seem to be attested at equally early date in 
Welsh documents (see Parry-Williams 35), are, as would be expected, masculine. 

Similarly, the association of the vowel e with the feminine gender, is responsi- 
ble for the fact that these words are f. in Welsh: crefft (OE creft m.), het (OE 
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hzt m.), tunnell (but conceivably from OFr. tonnelle, despite OE tunne? Latin 
words in Welsh with the termination -ell, regardless of original gender, likewise 
become feminine, cf. cangell f. ‘chancel’, from Lat. cancellus m.; cyllell f. ‘knife’, 
from Lat. cultellus m. Welsh -ell in inherited words of Celtic origin represents 
*4lla), wermod (with both e and 0, so that the likelihood of feminine gender is 
doubled, as it were; OE werméd is m.). It is possible that ystén belongs here, 
despite its long vowel; there are feminines in Welsh with final é from e, with 
secondary lengthening, cf. gwén f. ‘smile’, and ystén may have become associated 
with such nouns; it is from OE stzna, oblique case of stan. 

The OE loanwords in Welsh show, as one would expect, retention of masculine 
gender in nouns designating males, hence abad m. (from abbad, abbud, abbat 
m.; on ambiguity of provenience see above), carl (from OE carl, m., unless 
directly from Norse, see above), edling, edlin m. (from xdeling m.), gwalstod 
m. (from wealhstéd m.), wila m. (from dtlag, ditlah m.), probably hebog m. (despite 
the vowels e and 0; from OE heafoc m.). The word crupl, crupil (from crypel 
m.) may also belong in this category. For some reason there are no nouns in the 
list that designate females. 

For most of the remaining words no such general principle emerges to explain 
their gender. One can account for the masculine gender of cnul ‘knell’ (from OE 
cnyll m.) by noting that monosyllabic nouns in -ul seem to be masculine in 
Welsh without exception (bul ‘boll’, dul ‘bang, thump, thud’, hul ‘cover’, mul 
‘mule’ [from Latin], Sul ‘Sunday’ [from (dies) Sédlis]). It is, to be sure, not known 
how many of these words were in the Welsh vocabulary at the time cnul was 
introduced. Sul assuredly was, however (cf. H. Lewis 8, 47) and some such 
word as this may have set the pattern; there could have been other participating 
words, now lost. In some of the other loanwords, rhyme may have played a 
decisive réle. For rhawd f. there are several rhyming words of feminine gender: 
bawd, brawd ‘judgment’ (brawd ‘brother’ is, of course, m.), gwawd, etc., but there 
are also some masculines with the same vocalism, the most troublesome being, 
at first glance, the homonym rhawd ‘rout, crowd, troop’, a later loan, which is 
m. But the very existence of this homonymy may have worked toward keeping 
separate genders for the two words which would otherwise be completely alike. 
Similarly, bad, which is m., rhymes with a number of masculines, although a few 
feminines do occur to spoil the symmetry, and the same holds true for lloc and 
prés. The few words in Welsh which rhyme with pabi are, like it, masculine. 
On the other hand, cist, which is feminine, seems to have no feminine rhyming 
counterparts (dist, ffrist, sist, etc. are m.); cisi retains, without obvious reason, 
the gender of the OE original. 

Words in -an show fluctuation of gender for a very good reason. There is a 
Welsh suffix -an which is of common gender; hence it is not surprising to find 
loanwords with this ending (whether they are from oblique cases of weak nouns 
or from infinitives) being assigned to either gender, e.g., suran f., capan m. 
In fact, in the words cusan and cwpan gender fluctuates in Modern Welsh. 
The loanwords have been cited, for the most part, in their present-day forms, 
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with occasional reference to medieval predecessors. This apparently anachronistic 
procedure, which is employed by most works on the subject of loanwords in 
Celtic, can be defended on various grounds. In the first place, we have no way 
of determining what the form was in which these words were originally taken into 
Welsh nor their gender in Welsh at that time, in view of the absence of docu- 
ments of sufficiently early date. Secondly, it is perfectly legitimate to study 
modern counterparts of early loans and to investigate the factors, historical 
and otherwise, which have caused retention or alteration of gender. It should 
not be assumed that these OE words, when transmitted to Early Welsh, were 
immediately assigned to the gender they now have, although some no doubt 
were. Rather, there has been a process of adjustment and adaptation to certain 
patterns which have been evolved, even for ‘inherited’ Celtic words. That process 
is still in operation, as is witnessed by the variation in gender of some of the 
words examined above, not to mention the treatment of English borrowings 
occurring at the present time (in which, incidentally, the situation is almost the 
reverse of that noticed in OE loanwords, because now the direction of borrowing 
is from a system of virtually no gender to one of two genders). It is also certainly 
no accident that in Modern English loanwords in Welsh the masculines over- 
whelmingly outnumber the feminines. And there is some connection between 
this and the fact that masculines are more numerous than feminines in the 
Welsh vocabulary as a whole. This entails avoidance of much of the operation 
of lenition and indicates a ‘preference’ (not consciously, perhaps) for the easier 
category. 

Borrowing, it goes without saying, implies contact of languages and some 
degree of bilingualism. Learned borrowings, being literate (and often literary), 
possibly guarantee a greater likelihood of retention of original gender, provided 
the system of the receiving language can accommodate that gender. Yet even a 
word like offrwm ‘offering’, a learned loan from ecclesiastical vocabulary, was 
unable to assert its feminine gender against the pressure of the phonologically 
conditioned pattern which assigned words with the vowel w to the masculine 
gender. How long it retained its original gender, if at all, is solely a matter of 
conjecture, but it seems likely that the individual first introducing it (and the 
assumption that it was the work of one individual may, admittedly, be a vast 
over-simplification) was aware of its gender in OE and would consequently treat 
it as femir‘ne in Welsh. 

It might well be asked what gender means in a language like Welsh, where, 
even at a very early period, nominal inflectional endings vanished. It means, 
essentially, that a feminine noun reacts in a certain way to the operation of the 
process of consonantal mutation, a masculine in another, with some overlapping. 
Feminine nouns undergo initial mutation (lenition) after the definite article and 
induce lenition of a following adjective; masculines are not affected by this 
process. Also, in a small number of instances there is a special feminine singular 
form of the adjective. Apart from this, the only identifying sign of gender is 
the use of specific forms of the pronoun to refer to masculine and feminine nouns. 


ye 
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Significantly enough, the w:o alternation occurs in the demonstrative pronoun: 
hwn m., hon f. It was, then, the manner of fitting a loanword into this pattern 
that determined its gender in Welsh, according to the principles and analogies 
discussed above. If there were cultural, mythological and other motivating 
factors governing the choice of gender of inanimate nouns, they fail to appear 
in the borrowings from Old English. 


New York University 
Columbia University 
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ADDENDA ET CORRIGENDA TO “OUTLINE OF THE 
MANDARIN PHONEMIC SYSTEM”! 


HELEN WoNnG 


In the previous article we proposed the following seven vowels for Mandarin: 
/i, i, 3, u, e, a, o/. Further study has convinced us that we can dispense with 
the /o/ by treating it as /eu/ since the [o”], which is usually followed by the 
non-syllabic [y] and is not infrequently pronounced without rounding, is in 
complementary disiribution with [e], [e], [6], and [9], all of which are allophonic 
variants of /e/. The statement of the distribution of the allophones of this 
phoneme is then as follows: 

The tense variant, [e], occurs only before the nonsyllabic [j]; a lax variant, 
[e], preceded by non-syllabic [jj in final position or followed by a consonant; a 
flat variant, [6], after non-syllabic [ii]; a grave variant, [o”] adjacent to non- 
syllabic [y]; and, lastly, the neutral [9] in all other positions. 

The occurrences of /e/ may be illustrated by such examples as: 

/lei*/ [lej*] ‘tired’; /t*ie'/ [t*ie] ‘to stick on, to paste’; /lien*/ [lien*] ‘face’; 
/iie*/ [?36*] ‘month’; /c*eu?/ [ts*o” u’] ‘silk’; /le/ [la] perfective particle; /men?/ 
[mon*] ‘door’; /kea'/ [ka,"] ‘to place’. 

As a result of these changes we can no longer consider [ej] a free variant of 
/e/ and would view it as the sequence /ei/. The sequences /ei/ and /eu/ then 
parallel the sequences /ai/ and /au/. Nothing, therefore, need be said of the 
neutralization between /u/ and /o/ as former /iu? ~ io?/ ‘oil’ would be analyzed 
as /ieu?/ and /tup! ~ top'/ ‘east’ as merely /tuy'/, and so forth. 

Thus we now have six vowels displaying still, however, three distinctive 
features: 


Plain Flat Plain Flat 
Diffuse i ii Fs u 
Compact e a 
Acute Grave 


In accordance with the above revisions we need the following re-statements 
concerning vowel clusters: 

1. Vowels in all two-vowel clusters are opposed to each other from the point 
of view of the compactness feature except /ea/ in which both members are 
compact. 

2. Among the diffuse vowels only /i/ and /u/ may occur as first or second ele- 
ments in a two-vowel cluster whereas /ii/ and /i/ occur only as the first member 
of such clusters. 

3. All compact vowels can be followed by another vowel. They may occur 
after the non-compact vowels that are admitted as initial members of a cluster. 

4. Three-vowel clusters always contain a compact vowel in medial position. 


1 Helen Wong, ‘‘Outline of the Mandarin Phonemic System,’’ Word 9.268-76 (1953). 
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The first element of a three-vowel cluster is opposed to the last element in its 
flat—plain and/or grave—acute features—if the cluster contains only one 
compact vowel. That is, if the first element is /i/ then the last is /u/ and vice 
versa, except in /iai/. 

The following is the revised table representing the vowel sequences in Man- 
darin: 


i ii Fs u € a ai au ei eu ea 
i x x x x x 
a x z 
Fy x x x x x 
u x x x x 
€ = x x 
4 x x 


With regard to /1/, to clarify we note that the post-vocalic variant written 
formerly as [r] could be specifically described as a retroflexed vocoid, the phonetic 
quality of which depends on its preceding vowel. For example, /al‘/ [aa‘] ‘two’ 
and /el/ [99] diminutive suffix. 

Lastly, the diffuse acute nasal, /n/, in initial positions before the diffuse acute 
vowels, /i/ and /ii/, should have been transcribed phonetically as the dental [n] 
instead of [fi] as it appears in the previous paper (p. 269). 


University of Michigan 
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Les langues du Monde, par un groupe de linguistes sous la direction de A. 
MEILLET et MarceL ConeNn. Nouvelle édition, xlii + 1294 pp. & 21 maps. 
Paris: Centre Nationale de la recherche scientifique, 1952. 

Calling this volume a new edition of the well-known book published in 1924 is 
somewhat misleading because most of it is either completely new or thoroughly 
revamped. Yet it is truly indicative of the conservative approach that prevailed 
when this new version was planned in 1938, two years after the death of Antoine 
Meillet. Meillet’s name appears on the title page of a book “qui veut rester 
conforme & la méthode et aux enseignements du maitre disparu.’”’ Dedicating the 
book to his memory would, in my view, have been far more commendable. All 
those who had the privilege of being Meillet’s students share a lasting reverence 
for the man and the scholar. But, while some of them would tend to freeze his 
message into an orthodoxy, others prefer the kind of reverence that incites the 
disciples to try to emulate the master rather than to linger indefinitely in his 
tracks. One may have to question the teachings of an intellectual leader so as to 
be the more faithful to his spirit. Would it be unfair to say that publishing in 
1952 a survey of the world’s languages along lines agreed upon by Meillet in the 
early twenties, and justifying this by reference to one’s devotion to the defunct 
master, is to have misunderstood Meillet’s unflinching intellectual honesty for 
hieratic immobility? 

One will readily concede that the task Marcel Cohen set for himself in 1938 
was no easy one. Fourteen years before, few respectable linguists could have 
imagined a description of the world’s languages except as a collection of historical 
surveys of language families. Exceptions to a purely genetic classification could 
only be tolerated in those sections of the world where the progress of research had 
not yet revealed any but the most obvious kinship bonds. Such exceptions were 
only accepted as temporary deviations from the norm, likely to be eliminated in 
subsequent editions of the work. In the late thirties, however, the belief in the 
absolute preeminence of genetic classification was already shaken—today the 
time has come to remind some scholars that the study of language evolution is a 
legitimate chapter of linguistics. Cohen, at that time, decided against a deviation 
from the original layout whereby a synchronic characterization would have 
been given for every language, of the present or of the past, inasmuch of course 
as some description of the language was at hand or obtainable. Yet, in the course of 
the fourteen years it took to prepare the volume, no one could fail to register the 
expansion of descriptive linguistics. Some of the main contributors had obviously 
been influenced by functional and structural teachings, and eventually the char- 
acterizations of a few random “exotic” languages were included. Yet none of 
these sketches are really satisfactory, some because they are derived from un- 
reliable sources, others because they reveal the imperfect schooling of their 
authors, others still, although prepared by such a competent scholar as André 
Haudricourt, because they finally were “trimmed” by the editor. In a book of 
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such magnitude, basically a work of reference, a sampling is completely out of 
place; we want to be sure to find there a certain sort cf information for every 
language, and the description of some selected languages could only be accept- 
able if each one of these was meant as the illustration of a given type, and if 
every language quoted in the volume belonged to one of the types thus illus- 
trated. It is not the reviewer’s task to indicate how, in the present state of our 
science, a language cou!d be succinctly characterized as a structural unit, as the 
communication medium of a certain community, and as the product of an 
evolutionary process involving divergence and convergence. The international 
public is not likely to be satisfied with less than a repertory offering at least 
this minimum for every sufficiently well-known language. 

The volume under review contains many valuable contributions from excellent 
specialists, among them the chapter on Hamito-Semitic by Marcel Cohen him- 
self, but it is no such repertory. As a whole it will strike progressive linguists 
as somewhat amateurish and haphazard. With Cohen as an editor, we would have 
expected and welcomed more information in figures and percentages. Although 
Cohen is a convinced internationalist, and the roster of contributors presents not 
a few non-French scholars, the book is not free from unconscious ethnocentrism: 
the table of phonetic symbols contains a number of features practically unknown 
outside of France—and not particularly favored there except by Cohen and his 
students; on the map showing the distribution of cultural languages, Louisiana 
French is marked, but there is no trace of the numerically far more important 
Spanish speaking communities of the South-western United States; there is an 
obvious and laudable effort toward a critical selection of bibliographical items, 
but a few French works are cited which do not deserve to reach an international 
public. These are, on the whole, minor blemishes. More serious is the lack of 
homogeneity which defeats the very purposes of the book. 

Among the good features of the work, special mention should be made of the 
set of 21 maps, many of them in colors, placed in an envelope inserted between the 
last pages and the back cover. This does not mean that a linguist is not likely to 
detect at first glance a handful of errors or inaccuracies on the map or maps of 
his specialty: on the one presenting the IE languages of Europe, one could e.g. 
take exception to (1) the indication of Danish as the vernacular of the Faroe 
Islands, (2) the presence of a white (non-IE) spot between Hamburg and Bremen 
(is defunct Polabian meant, with the green of Slavic omitted?), (3) the boundaries 
of Franco-Provencal pushed too far to the North and not far enough to the 
South-east, (4) the whole of Cors:ca marked as belonging to the same are, (5) 
the assumption that German has been eradicated from Czechoslovakia, but pre- 
served in its pre-war expansion in today’s Poland and Russian East-Prussia. 
But one of the great advantages of these maps is that there can be no doubt they 
intend to refer to linguistic, and not to political or allegedly racial, realities. It 
was a good idea to distinguish between vernaculars and cultural languages. 
We have thus two Mercator’s projections of the world, one entitled ‘Langues de 
civilisation’, the other ‘Langues indigénes’. But a simple dichotomy does not 
suffice if one wants to give information concerning different periods: Algonquin 
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cannot be said to be the vernacular of New York City in the same way as Russian 
is the vernacular of Moscow. As regards the spread of cultural languages, a map 
would only make sense if we knew the exact uses to which each one of the lan- 
guages concerned is put in this or that area. An interesting criterion, in at least 
some sections of the world, might be the use of a language as the teaching medium 
in schools, as distinct from the language or languages taught there as subjects. 
But a complete picture could only be obtained through a number of different 
maps, each one based upon a different criterion. 

The volume is clearly printed on good paper, but the format is not well chosen: 
the book is too thick (close to 9 centimeters, 314”) in relation to its other dimen- 
sions (25 x 16 cmm.). These proportions would seem to put an undue strain on a 
cover which does not look too sturdy. Presented in the format used for this 
journal, on the same paper and with the same type, Les langues du monde would 
have been a much handier, 800-page volume. But one should probably allow for 
the natural bias of an editor. 


Columbia University ANDRE MARTINET 


W.F. Leopoip, Speech Development of a Bilingual Child: A Linguist’s Record. 
Vol I: Vocabulary Growth in the First Two Years; Vol. II: Sound-Learning in 
the First Two Years; Vol. III: Grammar and General Problems in the First 
Two Years; Vol. IV: Diary from Age 2. Northwestern University Studies, 
Humanity Series, Nos. 6, 11, 18, 19. Evanston, Illinois, 1939-1949. 

Of the three detailed linguistic studies of the speech of children known to this 
reviewer,! the present item is certainly both the most extensive and the linguis- 
tically most sophisticated. It is unfortunte that by the time Vols. II-IV were 
published after a wartime delay, Vol. I is now out of print, and the entire set 
can thus not be obtained together. Perhaps Northwestern University can be 
persuaded to reissue the first volume; certainly the scope and importance of the 
work merits it. 

The coverage of each volume appears pretty clearly from the subtitles, Vols. 
I-III each first present the evidence in systematic, rather than chronological, 
form, and then discuss it analytically from various viewpoints. Vol. IV is a 
separate entity, since it is an unanalyzed diary of data after the child’s second 
year, except for analytical comments contained in some of the diary entries. 
All four volumes are concerned primarily with the speech patterns of the author’s 
firstborn daughter Hildegard, dealing only annotationally with comparable de- 
velopments of his second daughter Karla. 

Leopold himself circumscribes the scope of his work as “definitely monographic 
in character. From wide reading in the enormous literature on child language 
I have come to the firm conviction that it is too early to attempt generalizations 
on this subject, at least from the linguistic point of view” (I. vii). Leopold has 


1 The other two are Antoine Grégoire, L’Apprentissage du langage, 2 vols. (Brussels, 
1937-47), and Karel Ohnesorg, O mluvnim viyvoji ditéte (Prague, 1948); id., Fonetické studie 
o détské ¥eti = Praéce z védeckiych uistavu LII (Prague, 1948). For Ohnesorg, see my review in 
Word 7.195-6 (1951). 
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here in his modesty underestimated the value of his contribution. Although—or 
perhaps because—he nowhere attempts any dogmatic statement of methodology 
or conclusions, but rather works with the analytical tools at hand (which, as in 
the case of Vol. III, sometimes may leave something to be desired), the very 
thoroughness of his investigation, and the dispassionate evaluation of his results 
by themselves serve to throw much-needed light upon a number of fundamental 
theoretical problems. 

For the general linguist not specifically interested in child language, Vols. 
II and III are the most suggestive. Vol. II contains, as part of a detailed analysis 
of the child’s sound system (138-206) and a discussion of the process of sound 
learning (257-266), the author’s reactions to Roman Jakobson’s theory of the 
acquisition sequence in the child’s learning of phonemic patterns.? Vol. III, 
especially in the sections on syntax (1-75) and general linguistic problems 
(102-153), contains much interesting evidence bearing on such problem areas of 
linguistic theory and pyschology of language as meaning, the nature of imitative 
learning, and the like. 

The child’s phonetic development is traced in great detail, and from a number 
of viewpoints, in the various sections of Vol II. Leopold here discusses, among 
others, the oft-mentioned but never clearly investigated problem of the pre- 
speech vocal phases of the child, and gives phonetic criteria for the crying, 
cooing, and babbling phases (161-162), also pointing at the considerable degree 
of overlap existing between various stages (crying and babbling, babbling and 
early speech, 141). In his description of the phonetic development of speech 
proper, Leopold astutely differentiates between phonemic and not-yet-phonemic 
use of sounds (195), and—as he does throughout all the volumes of his work— 
between permanent and temporary acquisitions. In his interpretative discussion, 
the author makes frequent reference to Jakobson’s theoretical framework 
which he considers ‘a decisive step forward” (274) and finds ‘Jakobson’s 
contrast theory confirmed in most of its phases” (204), although in a number of 
details ‘our findings disagree with Jakobson’s postulates” (ibid.). But the two 
basic theses of the Kindersprache, namely that limitations in children’s speech 
are “functional, not articulatory” (200) and that ‘the child began with coarse 
distinctions and progressed step by step to finer discriminations” (204), are fully 
supported by Leopold in his explicit statements, as well as implicit in his data. 

Whereas for a discussion of sound development, a more or less stabilized body 
of structural phonetic and phonemic theory and technique was available for 
Leopold to use, his discussion of morphemic problems in Vol. III suffers some- 
what from the lack of such a consolidated structural viewpoint in the field. 
Leopold, I think, chose the correct approach when he used a semi-traditionalist 
grammatical analysis in his description of this phase of the child’s learning 
process. In his concentration on the empirical observation of his daughter, and 
under the pressure of making his material available, he could undoubtedly ill 

? Roman Jakobson, Kindersprache, Aphasie und allgemeine Lautgesetze = Uppsala 


Universitets Arsskrift 1942: 9, Sprakvetenskapliga Sallskapets i Uppsala Férhandlingar, 
Jan. 1940-Dec. 1942, Bil. A. Uppsala, Almquist & Wicksell, 1942. 





n, 
rk 
1’s 


vO 
ch 
"se 
lly 


‘or 
1e- 
ld. 
ist 
ng 
nd 

ill 
ala 
ar, 





REVIEWS 77 


afford excessive methodological experimentation. Most of the more recent 
analytical procedures are based on purely formal criteria, but as Leopold says 
about syntax, ‘To understand the development of the child’s early sentence, we 
must probe into the hidden processes of the child’s mind, remaining conscious 
of the hypothetical nature of such inferences [italics mine]. Scholars who wish to 
restrict the domain of linguistics to the external form of language must refrain 
altogether from investigating children’s earliest syntax, because there is very 
little form to deal with” (1). Thus, much of the discussion in the Syntax of the 
First Two Years is essentially psychological, rather than formal-linguistic. Leo- 
pold attempts to discuss elementary one-word and two-word sentence patterns 
in terms of traditional meaning categories such as “statement,” ‘wish,’ etc., 
which find their expression in the child’s verbalization. This may appear aprioris- 
tic on the surface, but he bases his approximations of these categories on the 
child’s observed reactions to known stimuli. His inferences are in each case 
bolstered by an extensive observational record (as for his interpretation of “hoch” 
as an imperative, 9), and formal features of the linguistic pattern are considered 
in detail as they emerge (e.g., the child’s learning of “‘stress as a device used in 
the organization of syntactic patterns,” 32). Nowhere does Leopold explicitly 
state his exact technique for determining the child’s intent which he frequently 
infers from her imperfect attempts to communicate, but the assumption of 
parenteral emotional empathy with the child’s desires would not seem psycho- 
logically far-fetched and lends credence to his inferences. 

Leopold generalizes psychologically that the child progresses from emotional 
and volitional to intellectual speech (passim), and linguistically that the child 
progresses from simpler to more complex syntactic patters (subject-verb, verb- 
object, later subject-verb-object, 49). 

In his discussion of General Linguistic Problems of the First Two Years, Leo- 
pold touches upon some fundamental problems of the theory of language, and 
the psychological assumptions underlying it. He interprets his extensive obser- 
vations in a manner not conducive to some of the early behaviorist antimentalistic 
formulations, such as, for instance, sub-vocal speech. Describing play behavior 
with blocks of different color, he summarizes: “These categories of color had 
begun to develop, but their linguistic expression was in abeyance. She did not 
resort to emergency expressions. She used the clarifying principle of color in ac- 
tion, but did not try to express it in words. She had grasped it by means of words 
presented to her, but it existed in her detached from any active phonetic forms. 
Her behavior proved the reality of a concept, or set of concepts, which was not 
indissolubly linked with a movement of the organs of articulation” (106). In 
regard to another behavioristic assumption, namely that of the mechanically 
repetitive nature of imitation (which, as we know, has had some influence on 
linguistic language teaching procedures) it must not be forgotten that “imitation 
is by no means a passive process. The preceding section has shown that the 
imitation of language is anything but a mechanical problem, that it requires 
the most active cooperation of the child and the exercise of all the capacities 
which develop apace with it. More than that, the child remains the arbiter of 
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what to imitate. He makes a selection from the vast linguistic material offered 
by the environment and determines the order of its acquisition, often enough 
resisting efforts to teach specific items when such efforts are made” (156). It 
seems to follow from this and the preceding statements that language is a some- 
what more complex phenomenon psychologically than the substitute stimulus- 
response patterns of which we read in some of the ‘‘mechanistic’’ discussions. 

In the same part of Vol. III Leopold presents an extensive discussion of the 
child’s acquisition of word meaning, showing the gradual narrowing-down of 
originally rather extensive ranges of meaning. “The child subsumes experiences 
under standard words, the range of which has narrower boundaries in standard 
usage” (124), but ‘‘As the vocabulary grows, each item needs to embrace less 
and less semantic territory” (147). Many of the specific instances cited point 
to the gradual development of semantic contrast, thus the “striking special 
case’”’ of the “designation of logical opposites by the same word” (143). The child’s 
use of hot and heiss for both ‘hot’ and ‘cold,’ her use of up for both ‘up’ and 
‘down’ show a “two-phase discrimination . . .: normal and not normal” (144); the 
later separation of ‘hot’ and ‘cold,’ ‘up’ and ‘down’ is by Leopold called a “‘three- 
phase distinction” (ibid.); it could also be interpreted as a binary process in two 
steps: 

(1) normal vs. not-normal 
(2) not normal: hot vs. cold 
up vs. down 

Several comparable cases are discussed in this connection, and strengthen 
the interpretation in terms of contrastive development in terms of something 
like semantically relevant features. 

Problems of bilingualism are secondary to the general language learning proc- 
ess, because “Hildegard did not yet try to keep the two languages [English 
and German] apart, but build a hybrid system out of the elements of both” 
(179), which she sometimes used interchangeably. Structurally, “she adopted 
into her speech the features which the two languages have in common, and on the 
elementary level English and German are very similar’’ (186-187). The final sec- 
tions of Vol. III survey some of the problems contingent upon the child’s bilingual 
environment. 

Leopold’s study is of undoubted importance beyond the confines of linguistics 
in the narrow sense; this is also the author’s avowed intention, for “Linguists 
should not confine their attention to the mechanics of language, but cast a glance 
at the purposes which they serve and the manner in which they do so” (102). 
This clearly opens the door to adjacent disciplines, especially child psychology, 
with the literature of which Leopold has a remarkable familiarity. Throughout 
his four volumes, he makes frequent reference to the work of previous investi- 
gators, most of whom are in that field, and presents his material in a manner 
useful to non-linguists. His “Speech Development” should be standard on the 
reference shelves of all scholars dealing with genetic behavior problems, and 
certainly deserves a place in every linguist’s library. 

Institute of Languages and Linguistics Paut L. Garvin 

Georgetown University 
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UrnteL WernreIcu, Languages in Contact; Findings and Problems. With a Preface 
by André Martinet. xii + 148 pp. Publications of the Linguistic Circle of 
New York, Number 1. New York, 1953. 

The present book deals with linguistic aterference as the result of language 
contact. By interference the author understands “‘the rearrangement of patterns 
that results from the introduction of foreign elements into the more highly 
structured domains of language, such as the bulk of the phonemic system, a large 
part of the morphology and syntax, ‘and some areas of the vocabulary” (1.1). It 
attempts in particular “‘to show to what extent interference is determined by the 
structures of the two languages in contact’ (1.3). As “a full account of inter- 
ference in a language contact situation, including the diffusion, persistence, and 
evanescence of a particular interference phenomenon is possible only if the extra- 
linguistic factors are considered”’ (1.3), the author has, throughout the book, paid 
great attention to the socio-cultural setting of the linguistic problems. In the 
introductory pages and in the theoretical preliminaries (pp. 1-13), the author 
outlines his own approach and defines his basic terms. In the first section he 
examines the purely linguistic aspects of interference phenomena, whether or not 
they affect the languages as systems (pp. 14-70). In the second part the bilingual 
individual, the medium of all interference phenomena, ephemeral or persistent, 
is characterized (pp. 71-82); and in the final part the author gives a broad survey 
of all the extra-linguistic factors—psychological and sociological—which make 
out the bilingual’s background (pp. 83-115). A short appendix is devoted to the 
effect of bilingualism on the individual (pp. 116-122). No important aspect of 
the subject, which is a vast one, seems to have been left out, and the author 
deserves high praise not only for the unprecedented richness of the factual 
material he offers the reader, a material sifted and interpreted with both discern- 
ment and acumen, but also for having made use of all important contributions 
to the subject in the past. He has been able to present first hand material col- 
lected in the field by himself, as that illustrating the contact between Romansh 
and Schwyzertiitsch in the canton of Grisons (pp. 14-18, 84-86, and passim) 
and of Yiddish (passim). In a way this book may be said to offer on the subject 
in its general aspects what the volumes of Einar Haugen, conceived along similar 
lines and published at about the same time, give for one particular language 
(The Norwegian Language in America; a Study in Bilingual Behavior I-II, 
Philadelphia, 1953). Its value is heightened by the impressive bibliography of 658 
items, apparently as exhaustive as humanly possible on a subject whose limits 
are so difficult to draw. Very few books or papers published in any language, in 
America or Europe, seem to have escaped the author’s notice. 

The very richness of the contents makes it difficult for a reviewer to single out 
special instances for a more detailed discussion. It might satisfy the reviewer’s 
vanity to offer alternative solutions to specific problems, but it would hardly 
serve any useful purpose. Instead, this reviewer would like to draw the attention 
to some of the fundamental conceptions which pervade the whole treatment. 
The basic assumption is that the phenomena of interference, as defined by the 
author, imply the existence of bilingual individuals, living and speaking in specific 








80 REVIEWS 


social contexts. Curiously enough, this fact, which it should be superfluous to 
mention, was hardly taken into account in the older linguistic literature, where 
loans, borrowings, and mutual influences seem to occur in a social vacuum. 
The bilinguals may, in some cases, comprise nearly all the members of the com- 
munity. This is apparently true of many national minorities in the U.S. In other 
cases, they may form a small minority in their own community, but, given certain 
social conditions, their habits of speech, affected by their bilingualism, may spread 
to the speech of the monolingual members of the community. This diffusion 
factor, which is particularly important for languages which have been in con- 
tinued contact over long periods, is perhaps less fully treated in the monograph. 
Equally important in the author’s attitude is his emphasis on language as system, 
i.e. an arrangement of patterns which can be discovered through the linguist’s 
interpretation of speech events. The antinomy of language as system (code) and 
language as speech, as conceived by de Saussure, plays a great part with the 
author. He rightly points out at the start that all parts of language are not 
equally highly structured, and that therefore structural interpretations of inter- 
fereace phenomena are less obvious in the field of vocabulary than in the field 
of the phonemic and morphological system. For an experienced linguist with all 
the data at his disposal it is relatively easy to outline the phonemic system in a 
way that others will accept, and the very idea of system or patterns is easy to 
grasp and is easy to make use of. The chapter on phonetic and phonemic inter- 
ferences is therefore probably the one which will raise the fewest problems. The 
instances given by the author are well analyzed and carry conviction in most 
cases, at least for this reviewer. The data are well classified, and the main types 
of interference, over- and under-differentiation, reinterpretation of distinctions, 
phone substitutions, to which the complicating possibilities of hypercorrection 
are added, stand out clearly. Some of these mechanisms cannot be understood 
at all from a purely phonetic point of view. Interferences in the less structured 
lexical field are so well known and the extra-linguistic factors play such a dom- 
inant part there that a general survey of all the types cannot be expected to open 
up very new perspectives for the linguistic theory of interference. 

Of particular interest to the linguist is the study of interferences which affect 
morphology and syntax, and it is in this chapter (pp. 29-46) that the most diffi- 
cult problems arise; for, when dealing with grammatical categories, their ex- 
pression and their content, the author must tackle the problem of meaning and 
the nature of semantic systems and take a stand on these problems. In the open- 
ing paragraph (2.31) the author quotes the contradictory views of such author- 
ities as Meillet and Schuchardt. The former stressed the impenetrability of 
morphological systems, whereas the latter seems to have seen no limits to their 
capacity of mutually influencing each other. The author makes it quite clear that 
this paradoxical lack of agreement springs from loose definition of the basic 
terms, so that these two eminent scholars quite often do not speak the same 
language or do not speak about the same things, in spite of the apparent identity 
of their terminology—a sad and frequent case in controversies of this kind! By 
splitting up the general problem into several more specific ones and by trying 
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to define clearly his own terms, the author is on many points able to reformulate 
the questions in such a way that they can be answered. He admits that the 
transfer of full grammatical paradigms, with their formant morphemes, from one 
language to another, has apparently never been recorded (2.36, p. 44), but he 
shows conclusively that the transfer of individual morphemes of all types is 
possible under certain favorable structural conditions, that obligatory distinc- 
tions have been abandoned and that morphological oppositions have been re- 
interpreted (ibid.). Very interesting is the observation that most often the 
grammatical pattern which uses relatively few and invariant morphemes in its 
paradigms—the most explicit pattern—is more likely to serve as the model for 
imitation than the reverse (2.31, p. 41). The author sticks to his data, and 
without indulging in sweeping generalizations, concludes that a simple state- 
ment of the form: “ “Morphologies can(not) be mixed’ is premature at the pres- 
ent state of our knowledge” (2.36, pp. 43). 

When the author points out that “fewer phenomena of interference are in- 
corporated in the language as code than occur in the speech of bilinguals’ (ibid.), 
he is no doubt right, but in practice this leads him to an attitude close to that of 
Schuchardt. Schuchardt was mainly interested in speech events, and the notion 
of linguistic systems or the use that can be made of it never seemed to appeal to 
him. Meillet on the other hand was primarily interested in languages as well 
integrated systems, as codes (to use the author’s terminology), and perhaps less 
so in individual cases of code-switching. In spite of his Saussurean emphasis on 
the distinction between “‘langue’’ and “parole,” the author is inclined to over- 
stress the importance of collecting data in the field, e.g. when he says that “for 
an analysis that can do justice to the complexity of linguistic facts, the data 
must be obtained, first and foremost, from the flowing speech of bilinguals in the 
natural setting of language contact; the usual sort of evidence, taken from rela- 
tively well established languages, cannot be a substitute” (end of same para- 
graph). I wonder whether an accumulation of new evidence “from the flowing 
speech of bilinguals” would take us beyond what the author has already made 
perfectly clear, that in speech everything can happen, and that there is no limit 
to the erratic code-switching that can occur in individual cases under specific 
conditions. It is important to collect such data and to classify them in order to 
understand the mechanisms that are at work, but it is equally important to 
investigate the selection of interference phenomena that affects in a more perma- 
nent way the languages as systems. Written texts in “well established lan- 
guages,’”’ covering centuries of historical development, present us with data of 
another kind, but nevertheless data of the highest importance. By sticking too 
closely to empiric facts collected in the field and by relying too much upon the 
minute description and interpretation of individual instances of interference, 
the author runs the risk of losing sight of those changes in codes which can only 
be observed where languages have been in continuous geographical contact over 
long periods. The problem of convergent development of genetically unrelated 
and structurally different languages is therefore hardly touched upon in the 
monograph, probably because the questions connected with the diffusion of 
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interference phenomena are less sharply put than those concerning bilingualism 
in the narrower sense of the word. The study of linguistic convergence cannot 
be made without the data the author gives us, but it needs more than these 
data, namely a general and consistent theory of language, of form and meaning. 
Since we have not as yet such a generally accepted theory, it is probably true 
that we cannot make unambiguous statements about the mutual influence of 
morphological systems. It would be unjust to blame the author for not raising 
all these problems or for not offering new solutions to them. He is rather to be 
praised for his reluctance to make easy generalizations. His description of lan- 
guage contact and of the mechanisms of bilingualism is a remarkable contribu- 
tion to a general theory of interference and convergent developments. It will 
supersede many of the older treatments of the subject, and it will undoubtedly 
be a source of inspiration for future work. 

The presentation is impeccable. There are extremely few misprints, none of 
any importance as far as the reviewer can see. Some slight errors have crept 
into the text: on p. 44, 1.19, read Amer. Norwegian jdgg ‘jug’ instead of ‘yoke,’ 
and p. 48, 1.18, read Swedish o- instead of om-, on-. To the admirable bibliog- 
raphy the reviewer would like to add a few items: an interesting thesis on the 
assimilation of English words in the Norwegian of Norway—Aasta Stene, Eng- 
lish Loan-words in Norwegian: a Study of Linguistic Borrowing in the Process, 
London-Oslo 1940—and a too little known study by Henri Frei, Monosylla- 
bisme et polysyllabisme dans les emprunts linguistiques, Tekyo, 1936, where the 
structural differences between Chinese and Japanese are studied as to how they 
assimilate, or resist, foreign elements in the vocabulary. 


University of Oslo Hans Voet 


MacMILLAN, ALEXANDER S. M.D., & Ketemen, G., M.D., “Radiography of 
the Supraglottic Speech Organs. A Survey.” Reprinted from the A[merican] 
Mledical] Al[ssociation] Archives of Otolaryngology, 55 (June 1952): 671-688. 
The foregoing article presents in condensed form the results of a study aided by 

a grant from the Rockefeller Foundation. It is divided into four parts, (1) His- 

torical (2) Personal technique (3) Comments and (4) Bibliography of 92 items, to 

which a mimeographed addenda of 10 titles has been added. The authors follow 
the history of X-ray investigation of the speech organs (the larynx being ex- 
cluded), from the first paper of Max Scheier in Berlin (1897), only 2 years after 

Réntgen’s epoch-making discovery, down to Delattre’s comparative work on 

spectrograms and radiograms (1951). The interest of the study lies both in its 

historical presentation which allows the reader to follow the development of the 
technique from researcher to researcher, and also in the stress placed upon the 
excellent work of pioneers like Eijkman, Gutzmann, Russell, Scheier, Poirot, 
whose importance the new generation of phoneticians perhaps tends to mini- 
mize. The chief obstacle met with in these early days was the poor definition of 
the soft palate on the X-ray plates and the masking effect of the teeth, especially 
with gold fillings. The lateral movements of the tongue, too, which are so im- 
portant in the shaping of vowel resonators, could only be traced through a 
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technique of markers showing when the tongue was convex and when concave. 
The various oils and pastes suitable to bring out into relief the velum, the uvula 
and the tongue are also discussed. The authors then proceed to explain their 
own technique and point out briefly what remains to be done in this field. Un- 
fortunately they are not phoneticians and their conclusions are disappointingly 
vague as to what can be expected of X-ray photographs in relation with articu- 
latory problems and the formant theory. But they should be congratulated on 
a fine and thorough historical exposé and a helpful bibliography, to which few 
items could be added, such as: Curry, R., X-Ray Motion Picture Studies of the 
Vocal Organs’, JASA XIX.4 (1947): 723 ff.; Portman, Prof., ‘Radiologie et 
phonation,” Rapports, 4e Congrés de la Société francaise de Phoniatrie, Paris, 
1936; Scheier, Dr. Max, “Application des rayons X & la physiologie de la voix 
et du langage,” La voix parlée et chantée VIII.95 (1897): 321-327. 


Université de Montréal J. P. Vinay 


Eucene A. Nina, Learning a Foreign Language, A Handbook for Missionaries. 
ix + 237 pp. New York: Committee on Missionary Personnel of the Foreign 
Missions Conference of North America, 1950 (Cloth bound $2.00, paper $1.50). 
A book written by a competent scientific linguist (and missionary) with the 

aim of helping missionaries learn languages cannot be without importance for 
linguistic scholars. On the contrary, it has great interest not only for those 
who want to test the insights of linguistic science by practical application to 
language learning, but also for all who reflect on the great contributions which 
have been made and are continuing to be made by missionaries to scientific 
knowledge of various exotic languages. It is easy to think of the shortcomings of 
many missionary grammars, but one must not forget the positive worth, thru the 
centuries, of the linguistic work of missionaries. In some cases it has been a value 
of scarcity, that is, where we have little else on a language we have to treasure 
even a poor and limited record. But there are also important instances in which 
missionaries have produced excellent recordings or analyses, measuring up to the 
highest standards of their times. To mention just a few historical cases: Wulfila 
for Gothic, Carochi for Nahuatl, Edwards for Mohican, Byington for Choctaw, 
Kleinschmidt for Eskimo. Surprisingly Nida does not speak of these historic 
achievements. And yet, to mention them would have added much to the worth 
of his teaching, for the finest examples of the past would serve as an inspiration 
and a reminder to his missionary readers that their linguistic efforts in many in- 
stances have an importance transcending their own period of activity, that their 
apprenticeship in a language may end up being the basis for a significant perma- 
nent contribution to human knowledge. 

A consideration of historical achievements of linguist-missionaries might 
have also reminded the author of some valuable lessons of experience for their 
contemporary emulators. One important fact, which occurs to the reviewer, 
is that some of the most successful work on exotic languages was done by 
missionaries working in active collaboration with native speakers, witness 
English and Muskokee Dictionary, by Rev. R. M. Loughridge, D.D., Mis- 
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sionary to the Creek Indians, and Elder David M. Hodge, Interpreter (Phila- 
delphia 1914). Altho Nida speaks of native tutors and interpreters, he says 
nothing of the wonderful possibilities inherent in encouraging and developing the 
scientific linguistic potentialities of the native speakers. Particularly in treating 
problems of translation, he mentions the difficulties of poorly trained native 
interpreters and of the missionaries themselves; he should have gone on to show 
how a combination of talents—with the missionary principally responsible for 
encouraging the interpreter to get away from mechanical word-for-word render- 
ings and for explaining the sense of the English—can give a felicitous solution 
to this double problem. 

Nida’s handbook is for missionaries who expect to use well-known and widely 
studied languages as well as for those who hope to make their way thru relatively 
uncharted linguistic terrain. He includes a broad coverage of theoretical and 
practical phonetics, morphology and semantics, so that his readers can acquire 
the techniques of observation requisite both for improving one’s use of a language 
learned by conventional methods and for recording and analyzing a language 
without the aid of books or teachers. In handling his task, Nida is well aware of 
the phonetic, structural and semantic pitfalls, and regales as well as enlightens 
his readers with many anecdotes of awkward misunderstandings resulting from 
failure to recognize the simple but sometimes telling differences as one goes from 
one language to another. Actually, in the reviewer’s opinion, Nida has over- 
drawn the dangers and has not sufficiently tempered his portrayal by his oc- 
casional reassurances that one must make mistakes while learning and that the 
native speakers are usually tolerant and forgiving. There are too many musts 
in his recommended procedure and too many baleful warnings that given omis- 
sions will result in definite dire consequences. Such over-emphasis might not be 
too serious if each missionary were destined to come into contact wita only one 
language in his entire career, since one can hope with patient persistence to 
truly master one foreign language, but it frequently happens that a missionary 
comes into an area where, within a relatively accessible radius, several dialects 
or languages are spoken. If he is persuaded he must not tackle any language 
unless he is prepared to spend ten or twenty years mastering it, he may pass 
up the opportunity of obtaining information on the ethnography and language 
of the various neighbors of his assigned group. It would be better to encourage 
the missionary (as well as the government teacher or physician and the serious 
traveler generally) to become a skilled and inveterate observer, able and ready 
to undertake a long-term study or to record whatever he can of a language in a 
limited time, according to opportunity. Why give an unrelieved picture of diffi- 
culties when in actual fact there are many tongues which are phonetically and 
structurally simple and when, as often happens, difficulties in one phase of lan- 
guage may be compensated for by easily grasped systems in other phases? 

Following the prevailing usage among linguists, Nida discusses what Sapir 
sometimes called “brute phonetics” at length before taking up phonemics. The 
reviewer feels that this order is theoretically and pedagogically unsound, that 
it merely follows mechanically the order of development in the history of scien- 
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tific linguistics, and that it ought to be dropped both from scholarly and popular 
works. Outside of the lessons and talk of phoneticians and others influenced by 
them, pure phonetics do not exist. Instead, speech sounds occur only as instances 
of phonemes. The concept of a phoneme can easily be grasped by people who do 
not know a “phonetic” alphabet. And a scientific linguist can give a very com- 
plete and scientifically sound description of the speech sounds of a language 
without once using any symbol other than a minimum set of phonemic signs. 
(One simply says, for example, ‘“‘the p of speak is unaspirated, the p of peak is 
aspirated.”’) The traditional form of presentation and the traditional cliches 
lead Nida into the erroneous statement (142) that: “‘A strictly phonetic alpha- 
bet would indicate all these minute distinctions and many more’’—thus referring 
to the differences in the k of kill, skill, lick, lick candy, seek, sick, sake, sack, suck, 
sock, soak. But who has ever seen any phonetic alphabet with such a multitude 
of distinct signs? Every brute phonetic (i.e. non-phonemic) alphabet is a more 
or less traditional collection of signs so used as to reflect a few of the positional 
variants of some of the phonemes. There may occasionally be some temporary 
advantage gained by using special symbols for positional variants, but this 
procedure does not make an alphabet “strictly phonetic”. A phoneme sign, 
properly defined as to the free and positional variants it represents, is the acme 
of phonetic accuracy. It cannot be matched by an undefined symbol used in 
“brute phonetic” fashion. 

Nida’s generally adequate explanations and instructions on phonetic meth- 
odology omit the valuable technique of paired repetition: ‘For each contrast .. . 
select words having the given sound (in comparable position) and have your 
informant pronounce them several times in succession while you listen atten- 
tively .. .” (A Method for Phonetic Accuracy and Speed, American Anthropolo- 
gist 39.730, 1937). This is a simple and effective test for verifying or disproving 
the existence of supposed contrasts, and is also very helpful for training the 
“phonetic ear” of the novice. 

In his examples and discussion, Nida refers frequently to groups of related 
dialects and languages, but he does not include any real discussion of the kinds 
of similarities and differences one finds in related speech forms (there is one 
illustration of lexical differences in English on p. 54). An understanding of these 
matters would certainly be of great utility to many of Nida’s readers. 

While giving a fairly rounded discussion of the nature of language and linguistic 
methodology, Nida correctly reminds his readers that their language techniques 
will benefit from institutional instruction (51) and from further reading. How- 
ever, his bibliographic references are very meagre. He specifically recommends 
Pike’s Phonemics, Nida’s Morphology and a half dozen phonetic textbooks; 
Bloomfield’s Language and Hall’s Leave Your Language Alone come in for in- 
cidental mention in footnotes but only as the sources for one definition and four 
illustrative examples—in Arabic, Latin, Marshallese and Hupa. Surely a sys- 
tematic bibliography would have been a valuable addition to a book introducing 
scientific linguistic ideas and techniques to non-linguists. 

Stylistically, the book is composed in easily intelligible and readable (tho oc- 
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casionally repetitious) language. Typographically it is a fine illustration of effec- 
tive photo-offset printing from typewritten print-style copy. The uneven right- 
hand margin represents a great economy in production cost with no serious 
esthetic loss. 


Denver Morris SwWADESH 


Emite Misstr, Dictionnaire francais-roméique. A-J. Deuxitme édition entiére- 
ment refondue. xvi + 496 pp. Paris: Klincksieck. 

Despite some shortcomings, Missir’s French-Modern Greek Dictionary is an 
exceptionally good bilingual lexicographical work. The author’s daring break 
from a tradition in the compilation of Greek bilingual dictionaries which sys- 
tematically avoided representing linguistic reality is a welcome innovation which 
we hope will be followed. 

Greece is probably the only European country, with the possible exception of 
Norway, in which the difference between the official (primarily written) and the 
spoken languages constitutes a cleavage between two different linguistic systems 
with different morphology, syntax, and lexicon, although with basically the same 
phonology. The long Atticist tradition gave a certain prestige to the Kathare- 
vousa during the first decades following the proclamation of independence. In 
spite of the fight of the Demoticists and the endeavors of the entire nation to 
achieve linguistic unity, some bilingualism still exists in Greece today. The state 
and all the agencies of central and local administration, the Church, and to some 
extent the press and secondary and higher education still use the Katharevousa. 
However, the Demtic, extensively infiltrated by learned terms in its lexicon 
and somewhat modified in its morphology, has become the standard spoken and 
literary language. It is becoming the language of technology and science, and 
most likely it will soon represent the only linguistic reality in Greece. 

Missir seems fully aware of the linguistic development in Greece during the 
past decades. Although he tends to favor the Demotic, as pointed out in the inter- 
esting preface (1933) by Louis Roussel, he does not hesitate to use the learned 
terms when they unquestionably represent the linguistic reality. Nevertheless, 
the result of his lexicographical and philological investigations is not as fruitful 
as we might have expected. This is due primarily to the author’s prejudice against 
learned words used in the spoken language and to his sentimental attachment 
to the ideas of the first Demoticists as reflected in the very title of the book. 

The great number of entries and the thorough treatment of each one, com- 
bined with a genuine attempt to represent linguistic reality, make Missir’s the 
best bilingual Greek dictionary known to this reviewer. It will be very useful both 
to Greeks who study and read French and to foreign scholars interested in Hel- 
lenistic studies. However, if the latter are not well acquainted with Greek, the 
unsystematic and inconsistent listing of the Greek renderings might, in certain 
cases, mislead them. The listing of Demotic, learned, and even dialectal synonyms 
corresponding to the French entry is based neither on the rate of frequency of 
actual usage nor on the author’s preference for the most Demotic forms, but 
seems to be made in many cases on a purely random basis. This simple listing 
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of many synonyms will undoubtedly help Greek users of the book to form a very 
accurate idea of the semantic range of dispersion of each French entry. It must be 
stressed that nearly all the technical terms are faithfully and precisely rendered 
and their rendering is free of any of the author’s prejudices. Special consideration 
has been given to colloquialisms, idiomatic expressions and phrases, and even 
proverbs which are rendered by corresponding idiomatic expressions in Greek 
when that is possible. 

We look forward to the publication of the rest of this dictionary, and we hope 
that the next volumes will represent the linguistic reality even more faithfully 
and systematically than this volume does. 


Columbia University GrorGeE PapPpaGEOTES 


ALFRED Ernout & Francois Tuomas, Syntazxe latine. (2° édition, revue et 

augmentée.) xx + 522 pp. Paris: Librairie C. Klincksieck, 1953. 

A description of Latin syntax, authored by two scholars like Ernout and 
Thomas, 1unning to 454 pages of text, will be as complete as complete can be 
within its scope. The scope is indicated by the fact that the book is volume 38 
in a series entitled Nowvelle collection a V’usage des classes, and by the statement 
in the Préface (p. ix) that the authors’ aim was “donner aux étudiants un livre 
qui les mit au courant des résultats acquis dans ce domaine par plus d’un demi- 
siécle de recherches.” In this they have fully succeeded, in the usual conventional 
manner of dealing with syntax, no matter what the details to which one or the 
other linguist may take exception. 

The book is mainly meant for reference, I daresay, rather than for reading. 
Hence it is provided with no less than 65 pages of indexes, an Index des exemples, 
an Index analytique, and an Index des mots. In the text theories and opinions are 
stated without much consistent attempt to discuss critically assumptions and 
results, although references to more detailed treatments are occasionally given. 
The bibliography is concise and lists only the most important works. But all 
this is consistent with the authors’ professed aim to offer not a critical reference 
work for the scholar but an informative textbook for the student. 

The terminology used is the conventional one and there is nothing wrong with 
that: I suppose that in a Latin grammar few will be offended by the otherwise 
much decried Latin grammatical terms. But just as a revision of terminology is 
not in itself enlightening, so the users of long-established conventional terms 
should not employ nomenclature as a smoke screen in circular arguments with 
the hope, or the pretense, that calling things by names explains their nature. 
If it is said, for example (p. 18), that some verbs which are generally intransitive, 
like dolere, sperare, queri, often are followed by a direct object, then those verbs 
are, by definition, not intransitive: their being constructed with direct objects 
should not be explained away by calling them names. It is especially the long 
and often repeated terminology which lends itself to such “explanations,” though 
more recently invented terms are no less conducive to verbal legerdemain in 
what has aptly been called ‘hocus-pocus linguistics.’ 

What is especially gratifying to me as a linguist in this descriptive handbook 
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of syntax for students are two points: one, the insistence that, though Classical 
Latin does and should dominate our teaching and learning in the classroom, 
there are other much more widespread types of Latin which deserve, if not imita- 
tion in composition exercises, at least earnest consideration by all students, 
not only linguists; and two, the implicit rejection of the idea that all historical, 
retrospective and prospective, references in a synchronic description of a language 
are not only superfluous but indeed harmful and—mazima culpa!—unscientific. 

As for the first point, Ernout and Thomas emphasize the colloquial and vulgar 
possibilities of Latin syntax, but without appearing to detract from the impor- 
tance of Classical Latin. Hence the numerous examples from literary and other 
sources outside of the classical domain. 

As for the second point, concerning the historical approach in linguistics (and, 
for that matter, in sociology, anthropology etc.), I feel that, no matter how well 
and thoroughly a phenomenon may be described and committed to memory, 
true understanding can be attained only through asking and, if possible, answer- 
ing two questions such as those posed by Herbert Spencer in his First Principles: 
“How came it thus conditioned? How will it cease being thus conditioned?” 

If I have any criticism to make on that score with reference to the book under 
review, I should say that this syntax is not historical enough, even for its purpose 
as a school grammar. But there are, of course, physical limits to the size of a book. 

From experts like Ernout and Thomas one should scarcely expect grave 
omissions or errors, and no doubt the present second edition, published two 
years after the first, took advantage of all suggestions and objections addressed 
to its predecessor. All in all, then, this is a handy, well-made reference book. The 
pity is only, as I have had occasion to say in other reviews of foreign publica- 
tions, that many of our students, even graduate students in the humanities, in- 
cluding linguistics, do not know foreign languages well enough to read it and 
use it efficiently. 


University of Michigan Ernst PULGRAM 


JOHANNES Husscumip, Sardische Studien, Romanica Helvetica, vol. 41, 137 pp. 

Bern: A Francke, 1953. 

This presentation bears the detailed and informative title ‘Das mediterrane 
Substrat des Sardischen, seine Beziehungen zum Berberischen und Baskischen 
sowie zum eurafrikanischen und hispano-kaukasischen Substrat der romanischen 
Sprachen.” 

The author introduces his treatise with the postulate that since Sardinia was 
Indo-Europeanized first through the Romans, the pre-Roman vocabulary of 
Sardinian can derive only from pre-Indo-European times. This means, we may 
state in practical terms, that imbedded in the Latin vocabulary that became 
Sardinian are a certain number of terms of non-Latin, non-Indo-European origin 
which the Romans picked up from their predecessors in this insular region. 

Now comes the question: Which words are these and from what language or 
languages do they stem? 

There are no doubt some very old and some newer non-Indo-European words 
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in Latin and there is also no doubt that a number of these non-Indo-European 
words survived into the Romance period and still subsist. Such borrowings 
from superseded or from contiguous speeches are known to all periods, but where, 
as has happened, many of these ancient non-Indo-European speeches are still 
undeciphered or have completely disappeared, a determination of the etymologies 
of an apparently disparate and sporadic collection of words is a very hazardous 
undertaking, usually to be crowned with the limited reward of having produced 
a highly plausible penetration of the past and of having rendered impossible a 
disproof of the proposed solution. 

We could cite many instances of presumed lexical and even syntactical sub- 
strata attributable to languages already well known. The author of the study 
under review has no such easy base. It may be said anticipatorily that he has 
acquitted himself well in a very trying pass. An extensive time depth and great 
sparseness and inexactness of material rarefy the atmosphere in which he oper- 
ates. 

Several Romance scholars have made researches in the pre-Romance vocabu- 
lary of Spain and its possible relationship to Basque (Schuchardt). They have 
recognized African elements in Iberian; they have studied the relationship be- 
tween ancient Sardinian and the Iberian languages (Wagner). The theory hitherto 
held was that the substratum languages revert to Basque and Berber on the hy- 
pothesis that Basque is Iberian and Iberian goes back to the ancestor of Berber 
speech. The author concludes his study with the contention that his treatment 
and citations prove that a Basque and Iberian interrelationship can not be clari- 
fied by reference to the decipherment of Iberian with the help of Basque or 
Berber. The Iberians, he argues, are certainly not of African descent and Basque 
is closely related to the Caucasian languages. The author’s point is that African 
correspondences are due to a pre-Iberian substratum—the Eurafrican—or a 
radiation of Hispano-Caucasian from Spain to North Africa. 

This conclusion and the reconstruction of the history of hoarily ancient speech 
movements issues from the author’s cleverly considered and carefully documented 
correspondences of a number of evidently non-Latin and most probably non- 
Indo-European vocables labeling such concrete concepts as rock, oak tree, stone, 
certain flowers, heath, chestnut, and so on. 

Years ago this reviewer, in a paper presented before the annual meeting of the 
Linguistic Society of America, enumerated some striking phonological and syn- 
tactical correspondences of Spanish, Basque, Egyptian, French, and Celtic. No 
one was in a position to deny the correspondences nor to disprove the possibility 
of a substratum, but, on the other hand, the presenter of the paper was hardly 
able to conclusively prove the existence of the suggested substratum. 

Illustrations of the more outstanding types of Latin words delivered to us 
and which the author has treated are seen in caetra, cuscolium (Pliny), arrugia 
(Pliny). Some Romance forms examined are Spanish sarna, barranca/o, garra, 
French marelle. Caetra is defined in the dictionaries as ‘“‘a short Spanish shield” 
and the grammarian Servius in his commentary on Aen. 7, 732 had already told 
us it designated a shield “quo utuntur Afri et Hispani.’’ This, of course, furnished 
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the clue for inferring that the word was Hispanic. However, the author argues 
that Campidanian éerda, which has been explained as caetra, is a South Sardinian 
word unlikely to have first been brought to the Island through the Romans. He 
sees the form as pre-Romance Afro-Spanish (> Latin) caetra, a plant name. 
Spanish coscojo (scarlet berry of holm oak—< cuscolium) is Hispanic or Iberian 
and had an existence also in South Italy. Finally, -wliwm, the author argues, is 
not a Latin suffix. Sarna, “mange,” “‘itch’—it should be noted that Meyer- 
Liibke, REW., had already listed this as Iberian—is pre-Indo-European, as is 
shown by Basque sarra, without the n suffix [!]. Spanish barranco/a, “gorge,” 
“ravine,” formerly traced to Greek pharanz, a proposal which Meyer-Liibke did 
not quite swallow because of the phonology and geographic diffusion, rests upon 
pre-Roman *berro, “rock’’, according to the author. French marelle, “hopscotch” 
(a game with counters—disks), is based, according to Hubschmid, on *marr, 
“rock”. He includes here the hitherto unexplained Italian marrone, “big chest- 
nut”, French marron. 

Campidanian arréia, “‘pool’’, ‘“‘puddle”’, is, according to Hubschmid, insepara- 
ble from Pliny’s arrugia, of pre-Indo-European origin. The term was brought 
in by migration of pre-Indo-European-speaking peoples and diffused into upper 
Italy and Spain by Indo-European. 

An important by-observation is the author’s remark that we should reévaluate 
etymologically obscure Romance words which have German or Gaulish corre- 
spondences. 

The work continues with a cataloguing of pre-Roman suffixes and here such 
forms as Gallo-Roman cdssanus, ‘‘oak” and Spanish carrasca are discussed. 

The author urges a knowledge of the Sardinian vocabulary as of the greatest 
importance for research in pre-Indo-European Mediterranean languages. An 
extensive temporal and geographic vista is adumbrated—from the Caucasus 
Mountains to Iberian and Berber territory. Speculation can reach back to 
Egyptian and Semitic and even Sumerian. The very name [beri goes back to 
Caucasian, the author says. Archeological evidence can here serve us collaterally. 
Such is the logical and evocative conclusion of etymological studies covering 
what is in the light of known linguistic history a tremendous stretch of time. 


The City College of New York Eprxraim Cross 


Len GESCHIERE, Eléments néerlandais du wallon liégeois, Amsterdam: N. V. 

Noord-hollandsche Uitgevers Maatschappij, 1950. 

Among the definitive works in the field of Walloon serving for inquiry into the 
nature of its linguistic system is this book in which Geschiere presents alpha- 
betically his compilation of the Netherlandish borrowings into liégeois, with an 
evaluation of his reasons for assigning each element to that particular dialect. 
He follows a familiar approach in that he reveals dismay if the specific circum- 
stances of a given borrowing appear to mitigate against unequivocally ascribing 
it to its one dialect. 

May we not be in error if we seek consistently and exclusively to attribute a 
new formation to but one dialect or subdialect, to the exclusion of other lin- 
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guistic strata of influence? May it not be true that many words owe their origin 
to polydialectical influences, even while others are properly ascribed to a mono- 
dialectical source? 

Geschiere himself perceives clearly that ‘‘the very distinction between German 
and Netherlandish is, in marginal areas, indecisive’ (Geschiere, op. cit., p. 
xvi). He admits that lg. amo (‘orge hative,’ barley) may come from German; 
or may come from Netherlandish. What he does not add explicitly is that lg. amo 
may have a fourfold source: German plus Netherlandish plus limbourgeois plus 
eupenois. 

Conceiving, within the synchronic framework, the reality of certain dialect 
areas as blending almost imperceptibly into others adjacent and yet distinctive 
from the first, we trust that the future will bring at least the compendious state- 
ment of the above concept, and possibly its validation as a formally substantiated 
linguistic theory. Whether pro or contra, Geschiere’s well authenticated work will 
provide us abundant select grist for such a mill. 


Tufts College, Medford, Mass. WESLEY CONSTANTINE PANUNZIO 


A Footnote to the Review of Ropert A. Hat, Jr.’s Haitian Creole, Word 

9.292-296 (1953). 

In reviewing this book, I assumed that readers of Word would be interested 
mainly, like myself, in the structure of the language and in Hall’s interpretation 
of it. Since this review was written, however, a correspondent (Charles G. Rowe) 
has sent me a list of some two hundred errata most of which had escaped my atten- 
tion. Many of these are misprints—more or less obvious to those who already 
know both English and Creole—concerning accentuation, spelling, and word- 
division, such as ti-kouzé (for ti-kouzé ‘little cousin’), vi-ni (for vini ‘came’), 
té-ré-té (for té-rété ‘had stayed’), kiyé (for kiyé ‘spoon’), mwé-mém (for mwé-mém 
‘myself’), m-finé (for m-fini ‘I finished’), fa-miy (for famiy—also spelt fami— 
‘family’), or in English, or-tolans (for ortolans), wasked (fur asked), etc., common 
enough in all printed matter. But here they are unusually numerous; and this 
fact suggests that three accents and a hyphen may be too much for any ‘“‘prac- 
tical” orthography. Another type of apparent error has to do with presence or 
absence of nasalization. So we find both asasiné and asasiné ‘murder,’ famiy 
as well as famiy and fami, tdné as well as t6né (tonnerre), and many more. Are 
these to be regarded as free variants? Then again there are errors of translation 
(or transcription) such as déziém (bottom of p. 89) translated ‘tenth’ (top of p. 
90); either the text should read diziém or the translation ‘second’. On p. 95, pé 
bouch ou, tadé chéri is translated ‘be quiet, wait, my dear’; but tédé, according to 
the vocabulary, means ‘hear, listen’, and the phrase should be rendered ‘be quiet 
dear, do you hear.’ A few lines further on we find Epi latéma vin fini, mwé désan 
avék papa-m translated as ‘Then slowly it came to an end; I went down with 
my father.’ The “‘it”’ of the English is absent from the Creole; and without it such 
a construction is as impossible in Creole as in English. Of course, létémé should 
be latémé ‘funeral’—‘Then the funeral came to an end, etc.’ On the next page, 
yé ‘yesterday’ is translated as ‘just.’ Misunderstanding of tédé ‘(you) hear’ has led 
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to the translation, on p. 127, ‘‘Brother Malice, that little quarrel we had, has it 
ended (lit. stopped waiting)?” instead of “...is ended, do you hear?” A mwé 
(p. 129) means “Help!’’, not ‘Woe is me,” as the translation has it; and the rest 
of the sentence, translated as “here’s Malice trying to take the monkey’s tongue, 
I was a fool to go get it so far away” should read instead “ . . . tongue that I’ve 
just been so far to get.” On p. 131, sén ‘ashes’ is translated as ‘center’; and on 
p. 134, instead of A fok ou dbisiéz, translated as “Ah, do you have to be am- 
bitious,” read: A fos ou dbisiéz, and translate ‘You’re so very ambitious (that)’. 
The phrase sé nég ki fé tout b6 (top of p. 155) means ‘he’s an altogether fine fel- 
low’ or, more literally ‘he’s a fellow who’s altogether fine’, and not, as is trans- 
lated, ‘he’s a man who pretends (to be) all good.’ The enigmatically translated 
proverb on p. 193, ‘The person who tells you to put on salt in the sunshine, 
won’t help you to take it off when it rains” should be ‘People who tell you to 
put salt in the sun”—.e., to dry—‘‘don’t help you remove it when it rains.” 

The above-cited errors—some of which are reminiscent of a third-form French 
class in England—constitute only a small sampling of the total, but they suffice 
to show the dangers of dispensing with an ‘interpreter’s translation’—especially 
when one does not have mechanically recorded texts. Thus I am pretty sure that 
the first comma in “T%-fi ki pa-kéné lavé, pasé, Alé kay maémé ou’ (on p. 214) 
has been inserted in order to justify the supposed translation, ‘Little girl who 
doesn’t wash, pass by, Go to your mother’s house.”” Now in Creole, “to wash 
(oneself)”’ is usually rendered by béyé, literally “bathe”, while lavé normally 
refers to washing clothes. Moreover, lavé-pasé ‘“‘wash (and) iron” is a frequently 
employed compound; and if the suspected comma were removed the sentence 
would have the perfectly clear meaning, ‘Little girl who doesn’t know how to 
wash and iron, go (home) to your mother’s house.” 


Dominica Dovetas TAYLOR 


Josepru H. D. ALLEN, Jr., Two Old Portuguese Versions of the Life of Saint Alexis 
(Illinois Studies in Language and Literature: Vol. 37, No. 1). 68 pp. (with 
plates). Urbana: The University of Illinois Press, 1953. 

The basic prerequisite of historical linguistics is the availability of important 
source material in accurate editions, provided with scholarly aids for interpreta- 
tion and analysis. In many fields (e.g. French, English, and German linguistics) 
so much editorial work has already been done that it is easy for the student to 
take the source material for granted. So much remains to be done in the case of 
Portuguese studies, however, that it is only natural that the man who has given 
us a historical grammar of Portuguese, Provost Edwin B. Williams of the Uni- 
versity of Pennsylvania, should have urged those who were trained under his 
direction to fill the gaps. The result has been not only a series of doctoral disserta- 
tions, but also the most welcome appearance of monographs such as the present 
one. 

Leaving aside all hagiographical considerations and, except for some details 
of stylistics, most literary problems, Mr. Allen concerns himself with a diplomatic 
edition and a linguistic analysis of the text of the Alexis story as it occurs in 
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two Portuguese manuscripts, Codices 36 and 266, originating from the monastery 
of Santa Maria de Alcobaga. One of these, Codex 36, has already been edited by 
Esteves Pereira in the first volume of the Revista Lusitana (1887-9), but this 
new edition is justified both by the inadequacies of the Pereira edition and the 
desirability of de tiled comparison with Codex 266. 

The present volume offers all that could be desired in the way of paleographical 
information: the accompanying plates reproduce both manuscript texts in full, 
and the pagination of the transcribed text is made to correspond exac ‘iy to that 
of the codices. In addition, the last six pages of the prolegomena coniain a com- 
plete analysis of the handwriting, including abbreviations and other symbols, 
so that this edition will be invaluable to beginning students of Portuguese pale- 
ography. 

In dating the texts, the editor concurs with Williams in placing C36 earlier 
than C266 (thus reversing the judgment of Pereira). This fact, substantiated by 
numerous orthographical details, provides the basis for most of the study, which 
deals primarily with the changes that the text of C36 has undergone in its later 
version as shown in C266 (C266 in turn has been subjected to corrections, written 
over the original forms; since the corrections are concentrated in the early part 
of the text, the editor rightly concludes that some external circumstance hindered 
the completion of the process). Both texts, however, antedate the final loss of 
intervocalic -d-, which was not written after 1435, and C36 probably belongs to 
the end of the fourteenth century. 

In addition to listing all the details of orthography, phonology, morphology, 
and word formation that result from a comparison of the two texts, the editor 
discusses word order and stylistics, in which respect the changes made in the 
redaction of the later text all point in one direction: greater literary pretensions 
and more scrupulous choice of words. Perhaps the most interesting and most 
significant detail of the entire study is the evolution of the final nasals -om, -am 
and -do, which originally conformed to their etymological status, but gradually 
became interchanged. By means of statistics classified by grammatical categories, 
Mr. Allen shows that the interchange, starting in the imperfect ending -abant 
in C36, had spread to the preterite endings -arunt and -erunt in the original form 
of C266, whereas the corrected text of C266 shows not only an increase in the 
irregularities in the preterite ending, but also an extension of the tendency to 
three other categories, the negative nom, the nouns derived from Latin -onem, 
and the conjunction entom. Other types, such as the third plural ending of the 
present subjunctive and future indicative, still had not been affected at the 
beginning of the fifteenth century. 

The work is free from inaccuracies, except that “fifth declension” at the bottom 
of page 25 should read “fourth declension,” and in two places the turn of phrase 
may be misleading for an unwary reader: (1) On page 11, in speculating on the 
breakdown of the orthographical convention governing the use of the silent letter 
h, the author says: ‘Whether this breakdown is due to the time interval between 
the two texts or to differences of dialect between the two scribes, we cannot say 
without additional evidence.” It would seem that “dialect’”’ as such could have 
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very little connection with a purely orthographical matter, especially when the 
letter concerned is silent. (2) On page 25, in the discussion of haplology in a 
phrase, comparison is made between homé de Deus and the equivalent l’ome Deu 
in the French Vie de Saint Alexis. Of course the author has no intention of imply- 
ing that the very common Old French use of the cas régime of proper nouns to 
indicate possession has anything to do with haplology, but the occurrence of 
this example, in these terms, at this particular point of the discussion might 
leave such an erroneous impression. 

The book contains ample bibliographical references, but no glossary, which the 
editor probably considered unnecessary. This edition, in every respect definitive, 
will provide, with its exhaustive analysis, an invaluable help to everyone in- 
terested in the history of the Portuguese language. 


Brown University Water J. ScCHNERR 


Knup Toassy, Mode, aspect et temps en espagnol, Det Kongelige Danske Viden- 
skabernes Selskab, Historisk-filologiske Meddelelser, vol. 34 no. 1. 136 pp. 
Copenhagen: Ejnar Munksgaard, 1953. 

In publishing this monograph the author’s main purpose has obviously been 
to win adherents to his own descriptive method. Through the detailed analysis 
of part of the morphological system of a well-known language, Togeby, in his own 
words (p. 5), is aiming not only at giving a more objective description of that 
system, but chiefly at proving the possibility of such a method as consists in 
defining the elements of a pattern by reference to their relationships (‘functions’) 
in the context without any recourse to substance, in this case the semantic one. 
The book under review is thus meant for both non-structuralists and those 
structuralists who are reluctant to accept glossematic theories. Accordingly, 
Togeby has avoided, as far as possible, making use of the esoteric terminology of 
glossematics: nowhere are we told explicitly of the ‘immanent method’; the 
few jargon terms that appear belong to the most understandable ones. That 
even Louis Hjelmslev is not mentioned anywhere, can hardly be wondered at, 
when we keep in mind the author’s restraint from using any learned apparatus 
in this monograph, and we consider Knud Togeby’s particular position among 
glossematicians (cf. Word 9.79). 

From his own standpoint Togeby is no doubt right to disregard previous gram- 
matical research on the Spanish verb, since it may be described as non-structural 
and on the whole ‘substantialist’. It is strange, however, that Togeby should 
nowhere concern himself with Emilio Alarcos Llorach’s structural analysis of 
the Spanish verb (Boletin de la Biblioteca de Menéndez Pelayo 1949, 50-83, and 
Gramdtica estructural segin la escuela de Copenhague y con especial atencién a la 
lengua espafiola, Madrid 1951, Chap. VII, 97-126), the more so since, in the main, 
Alarcos has been proceeding according to Copenhagen descriptive methods. 
Togeby chooses, quite naturally, to operate with first-hand data: an extensive 
selection of 20th century texts, including two 1952 newspapers. 

Togeby’s first task is to identify the elements of the system and to classify 
them according to the three ‘categories’ of mood, aspect, and time (Chap. I: 
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‘Les catégories et leurs éléments’ pp. 7-10). When he only includes the ten simple 
forms (cant-dis, -abais, etc.) in the system, we should expect to be told on what 
grounds he has excluded the compound forms with haber (he cantado, habia 
cantado, etc.) which are available beside each simple one except the imperative. 
Such an exclusion, however, does not on some occasions prevent Togeby from 
handling these compound forms with some amount of inconsistency (e.g. on p. 
85 habta prestado is called a pluperfect; on the preceding one we find habta 
perdido as an imperfect, whereas on p. 66 habia sido is accounted for as a ‘perfec- 
tive’ as against the ‘imperfectiveness’ of the imperfect tense he assumes else- 
where). In fact there seems to be no real support for their being ruled out of 
the system, whereby, moreover, Togeby fails to recognize the existence of the 
whole correlation (‘category’ in Copenhagian terms) of relative past time. It 
should be clear that such an exclusion could be only supported if the ‘content’ 
of such a form as he cantado were nothing but the content of he plus the content 
of cantado. But this is not the case because he does not exist independently outside 
these compound forms and the so-called periphrastic ones (he de cantar) where 
its content is a wholly different one. As to the simple forms Togeby is dealing 
with, they are classified into the three remaining ‘categories’ (number and per- 
son are not taken into account) by means of their ‘function’ in accordance with 
the basic tenet of the immanent method. Cant-éis, -areis, -arais, -aseis belong 
together as subjunctive elements because they are found to follow querer que, 
sin que, etc., which is not the case either of -dis, -aréis, -artais, -asteis, -abais that 
follow other adverbs and prepositions, or the imperative -ad that is never found 
depending on any other word. Within the indicative mood as a whole, Togeby 
establishes the ‘category’ of time resting upon the fact that such an adverb as 
ayer needs to be accompanied by a verb ending either with -asteis or with -abais 
whereas manana occurs with -aréis or -artais. By applying this method Togeby 
also isolates the ‘categories’ of time and aspect among the subjunctive elements. 
That the difference existing between the future cantaré and the ‘conditional’ 
-aria is one of aspect, proportional to that separating the perfect -é from the 
imperfect -aba, is a conclusion liable to puzzle those who know Spanish and feel 
no theoretical prejudices on this matter. For cantaria is indeed nothing but the 
future as regarded from a past standpoint, which accounts best for its frequently 
being used as an hypothetical past tense (e. g. ayer cantartais mucho parallel to 
the hypothetical future in the present time ahora serdn las tres de la tarde), a 
fact that is not taken into consideration by Togeby when he regards the ‘con- 
ditional’ as a merely future tense. On pp. 82-84 an extensive development of 
these ideas is to be found, according to which a past tense governs (‘dirige’) the 
imperfective aspect (imperfect and ‘conditional’) in the dependent phrase. But 
it is worth noticing what Togeby himself states on p. 84: ‘les exceptions & cette 
régle sont nombreuses et inexplicables par d’autres régles, de sorte qu’en fin de 
compte il faut se résigner 4 accepter que la régle n’est pas valable en espagnol.’ 
Hence Togeby ought to draw the conclusion that the ‘functional’ definition of 
aspect he has set forth is to be rejected. As a matter of fact, such a ‘direction’ 
as the one assumed in this case by Togeby concerns not the aspect but time 
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‘category’. In anuncié el gobierno que examinarta la posibilidad| de establecer el 
jurado (Almagro 580; on p. 84) the ‘conditional’ tense examinaria is nothing but 
the future as seen from the past anuncié. In an ‘exception’ such as finalmente 
anunci6 que en la primera decena del préximo mes de junio se reunird el Consejo 
(ABC 26; on p. 84-85) the future se reunird is not governed at all, since it is 
regarded from the present standpoint of the writer (the newspaper in which this 
sentence is read is of May 29!). It thus becomes clear that, if within the indicative 
as a whole, besides a present (canto) and a past tense (cantaba/canté), a future of 
the present (cantaré) and another of the past (cantaria) are available, it follows 
that such a temporal pattern organizes itself according to two wholly different 
correlations (‘categories’): present/past on the one hand, and future/non-future 
on the other hand. This amounts to saying that in assuming only one time 
category Togeby rests, in the last instance, upon very much the same logical 
and ontological basis as traditional grammar (that structuralism is meant to 
try to overcome). Regarding aspect, it should be clear, that, if we retain opposi- 
tions as the sole means of defining linguistic units, then such a grammatical 
‘category’ is only to be assumed within the non-future past indicative between 
the imperfect cantaba and the perfect canté. For the other simple forms and the 
compound ones as well are exhaustively defined in the frame of correlations 
other than that of aspect, and therefore are undifferentiated units as far as aspect 
is concerned. 

Notwithstanding this, once he has defined in his way both aspect and time, 
Togeby undertakes at the very outset of his analysis (pp. 8-9) the definition of 
the elements. According to him, there is a neutral present (canto) surrounded 
by two aspects (the ‘imperfective’ one: imperfect comta and ‘conditional’ comerta; 
the ‘perfective’ one: perfect comt and future comeré) and two time-terms (future: 
comerd, comeria; past: comié, comia). Against such a procedure several objections 
could be raised. First of all, there is a real danger of losing sight of the facts once 
an aprioristic definition of the two aspects has been set forth, for we cannot rule 
out the possibility that the two members of a correlation other than those of 
mood and time may be defined otherwise than as ‘imperfective’ and ‘perfective’. 
On the other hand, although we are willing to agree with Togeby in defining the 
present indicative (and many other tenses as well) as neutral regarding aspect 
differences, we cannot follow him when he denies the present indicative to be a 
time unit. Such a statement is not at all in keeping with the pattern of the sys- 
tem as outlined above, and is nothing else but a result, among many others, 
of the capital fact that Togeby fails to operate with a basic distinction of struc- 
tural linguistics: the one assumed between a value as defined in the frame of 
the system and its actualizations, a distinction that corresponds to a great extent 
to the glossematic one between on the one hand invariant and, on the other hand, 
varieties and variations. For it should be clear that the grammatical notion of 
present time is liable to several psychic actualizations (e. g. the so-called his- 
torical present, not to mention other not so well known uses), and care should 
be taken not to confuse them with the actual value of that unit. 

We have purposely analyzed Chapter I at length because it elucidates—and 
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almost wholly predetermines—the manner in which Togeby carries out his analy- 
sis throughout the other three Chapters, since the ‘functional’ definition of the 
‘categories’ and their elements (Chap. II: ‘Définition des catégories’ pp. 10-105; 
III: ‘Définition des éléments’ pp. 105-117) is made on the basis of such a classifi- 
cation as established in I, one which, as we have shown, lacks any solid structural 
background. Chap. III is devoted to determining either the extensivity or in- 
tensivity of each element within each ‘category’, as is usually done by Copen- 
hagen linguists. In doing so, Togeby builds upon the preceding all-blurring de- 
scription of the ‘functions’ and moreover on several rough statistics, so that no 
reliable result might be expected. On the other hand, the distributional criteria 
Togeby applies in order to determine the extension of each element are sometimes 
puzzling enough. When Togeby defines for instance the mutual relation between 
the indicative and subjunctive moods, I wonder if the coincidence (‘syncretism’) 
of 1st and 3d singular (indic. cant-arta, -aba: subj. -e, -are, -ara, -ase) has any 
bearing either upon the extensivity or upon the intensivity of these two moods. 
As a result, the two elements of each ‘category’ are found by Togeby to have 
very much the same extension, a conclusion which (not to speak of its heterodoxy 
in relation to glossematic theory) would amount to asserting that, the present 
tense set aside, all the elements are non-marked terms, and therefore it would 
lead us to deny the existence of any correlational (categorial) organization in the 
system. Yet the conclusion drawn by Togeby is that, as against Roman Jakob- 
son’s ‘dichotomous scale’ applied to morphological patterns, ‘il vaudra mieux 
en espagnol décrire les systémes 4 deux éléments extensifs comme deux formes 
sémantiquement marquées’ (p. 117). Once Togeby has defined the extension of 
the elements, he takes up their semantic definition (Chap. IV: ‘Sémantique’ 
pp. 117-131), the ultimate goal of his research. Such a definition is wider or 
narrower according to the extension he has previously established. But when one 
reads that ‘la valeur sémantique des différentes formes verbales s’observe le 
mieux dans l’emploi libre, par exemple dans les propositions principales et rela- 
tives’ (p. 117; italics mine), one cannot help questioning whether it would not 
be advantageous to start with just a semantic analysis and spare all the pages 
where Togeby has been trying to avoid operating with semantic substance. 

It is a great pity that such a good scholar in practical Spanish as Knud Togeby 
should have been led by theoretical prejudices to such results as the ones he 
has actually reached. For, out of the frame of his theory, the Spanish texts have 
been as a rule perfectly understood, although Togeby fails to realize the clear-cut 
distinction existing in several respects between colloquial and literary Spanish, 
regarding, for instance, the use in the latter of cantara as pluperfect indicative, 
which results from an artificial romantic resurrection of the etymological Latin 
(and Old-Spanish) meaning of ama(ue)ram, and which, if it became embodied 
in the system of living Spanish, would no doubt lead towards a diachronic re- 
organization of the whole. 

I do not think many linguists will be found to agree with the results arrived at 
in this analysis of the Modern Spanish verbal system. It must be possible to 
present a more consistent and, perhaps, more convincing application of the 
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immanent method than the one Togeby is offering here, and we may hope this 
will be done. But, with such analyses as the one under review, it is difficult to 
escape the conclusion that, at best, the validity of the immanent approach to 
language is still to be proved. 


Universidad de Salamanca Martin 8S. Rurp£rez 


Luis FLérez. Lengua espafola. (Instituto Caro y Cuervo, Series Menor III). 

330 pp. Bogoté 1953. 

In addition to its valuable publications on questions philological, the Instituto 
Caro y Cuervo has produced a radio program entitled ‘‘Disquisiciones filolégicas,” 
to encourage popular interest in the language. Luis Flérez, author of La pro- 
nunciacién del espafiol en Bogotdé (1951), gave a series of talks which are here 
published as 21 separate articles. The articles are necessarily brief and are es- 
sentially a popularization of the material discussed (Filologta y lingiitstica, Las 
partes de la oracién, Gramdticos escolares, Castellano o espafiol?, etc.). While 
Fidérez gives an intelligent resume of what has been done he does not, in most 
cases, pretend to add anything new except his own observations. Of more im- 
portance are six studies on the pronunciation, the hyperboles, the Gallicisms, 
the Anglicisms, the vulgarisms, and the archaisms of the Spanish of Colombia. 
It must not be supposed that the listed examples are exclusive to Colombia; 
for example 25 of the 26 listed Anglicisms appeared in Alfaro’s Diccionario de 
anglicismos (Panama, 1950). Another article describes the importance in the 
linguistic field of Bello, Menéndez Pidal, Tomds Navarro, and Amado Alonso, 
although to title this chapter Filélogos hispano-americanos seems an error. The 
commentary-bibliography of the basic studies on American Spanish is very useful. 
The book is of interest to all and is particularly instructive for one not well in- 
formed on the topics treated. 


Columbia University M. M. Las.Ley 


Ernst Scuwarz, Goten, Nordgermanen, Angelsachsen. Studien zur Ausgliederung 
der germanischen Sprachen. Bibliotheca germanica II. 277 pp. Bern: Francke, 
and Munich: Lehnen, 1951. 

The traditional classification of the Germanic languages into East, North, 
and West, which is still taken for granted by most of our text-books and manuals, 
has been under severe challenge in recent years. The orthodox theory of three 
different parent languages for each of the groups concerned! was already rejected 
by R. C. Boer® who points out that this division constituted “geen volstrekte 
scheiding,” but “beteekent dan ook in den grond niet meer, dan dat de talen, 
die men wgerm. noemt, een zeker aantal gemeenschappelijke eigenaardigheden 
hebben.” His conclusion (§7) that each pair of geographically adjacent dialects 
shared certain common characteristics, and that they should be grouped as links 

1 “tre sprakgrenar, forutsittande havr sitt jamférelsevis enhetliga ursprak’’ (A. Noreen, 
Vadrt sprdk I, Lund 1903, 72). 

2 Oergermaansch Handboek, Haarlem? 1924, §3. 
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in a chain stretching from OHG. via Du., OS., OFris., OE., ON., Gutnic to Goth. 
rather than on the family tree pattern implicitly revives J. Schmidt’s earlier 
presentation,* and at the same time fundamentally anticipates the views of many 
later scholars. While the closer relations between Goth. and ON. and possibly 
the earlier existence of a Gotho-Norse speech community were recognized long 
ago,‘ recent discussion has concentrated chiefly on the concept of West Germanic 
unity and the contacts between Goth. and OHG. 

In the study under review Ernst Schwarz not only summarizes the results of 
recent research,® but also works out stimulating conclusions of his own. As the 
present reviewer sees it, the problem is this: The isoglosses linking two or more 
old Germanic dialects are well known and have been repeated over and over 
again with only minor variations. They do not, however, yield mutually exclusive 
groups, but constantly overlap. It is doubtful whether with the exception, per- 
haps, of ON. and OSw. any two of the languages corcerned resemble each other 
significantly more than either does a third.* The earlier investigators bent on 
fitting the facts into the family tree pattern were thus forced arbitrarily to select 
the evidence pointing in the favored direction, and to ignore or explain away all 
the rest. In view of this a genetic classification turns essentially on devising a 
method for assessing the relevancy of different isoglosses’ and the terms in which 
they are to be accounted for. 

F. Maurer® despairing of the reliability of linguistic criteria alone, calls for 
close co-operation with archeology. His proposition is to the effect that linguistic 
parallels should be considered genetically significant only in so far as they can 
be stated in terms of historical and cultural contact. In practice this apparently 
means that language classification must be made to fit the results of archeology 
and pre-history.® 

This is the method implicitly’® and consistently followed by Schwarz in the 
first part of his book." The testimony of Jordanes and the archeological finds 


3 Zur Geschichte des indogermanischen Vokalismus II, Weimar 1875, 452 f. 

4H. Zimmer, ZfdA 19, 393-462. 

5 Few relevant publications appear to have escaped his attention, e.g. B. Trnka, Syn- 
taktickdé charakteristika feti anglosaskych pamdtek bdsnickych, Prague 1925 (comparative 
study of OE. syntax); G. Langenhove, Linguistische Studién I, Gent 1936, 41-72 (study of 
Gotho-Norse innovations, mentioned by Schwarz p. 274, but not discussed); A. Martinet, 
La parenté des langues germaniques, 5éme congrés int. des ling. Rapports 134-147; H. L. 
Smith, The Verscharfung in Germanic, Lang. 17, 93-98; F. Mossé, Manuel de la langue 
gotique, Paris 1942 (esp. pp. 42-44 on the sounds of Goth. at, au). 

6 Cp. E. Prokosch, Comp. Germ. Gr., Philadelphia 1939, §5d. 

7On this cp. now J. H. Greenberg in Anthropology Today, ed. A. L. Kroeber, Chicago 
1953, 265-286. 

8 Nordgermanen und Alemannen, Bern and Munich,? 1952, pp. 5, 12-14, 65 passim. 

® The provision that, if this proves impossible, the latter are to be invited to revise their 
conclusions, is never invoked. 

10 As he rarely spells out his preliminary assumptions, but usually leaves the reader 
guessing, the reviewer begs to be excused, if he has occasionally guessed wrong. 

| He is also indebted to Maurer for much of his terminology ‘“‘neuesten Begriffsbildungen 
entsprechend”’, such as Nordseegermanen (p. 188). 
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suggest that the Goths came originally from Ostergétland and/or Vastergétland, 
and settled in West Prussia early in the first century A.D. (pp. 13-19). From 
this he infers (p. 47) that the linguistic evidence must, if possible, be treated in 
terms of a Gotho-Norse parent language which he proceeds to reconstruct. A 
detailed comparison yields the familiar list of isoglosses with a sprinkling of ad- 
ditions (pp. 144-146), while the evidence which does not fit the book is turned 
down as irrelevant” on the strength of the incessantly repeated argument, ‘dass 
das Got. eine Tochtermundart des An. ist und dass sich der Blick zuerst auf 
dieses zu richten hat’’ (p. 69). The peculiarities which Goth. shares with languages 
other than ON. or with none are said to arise from dialectal differences in Gotho- 
Norse, from leveling in similar and different directions respectively, or to con- 
stitute innovations due to the absence of ‘‘Volksgemeinschaft” (p. 68). As the 
author looks for the original home of the Goths in Ostergétland and Vastergit- 
land, not in Gotland, he also (p. 149 f.) makes light of the Gotho-Gutnic agree- 
ments. For similar reasons he denies Finnish borrowings from Goth.,!*and credits 
the emigrant Cimbrian ‘“‘Volksgenossen”’ (p. 27 f.) with having passed to the 
latter its Celtic loanwords. 

In his investigation of Anglo-Frisian the author turns to a more strictly lin- 
guistic approach inspired by the methods of dialect geography. The isoglosses 
peculiar to Anglo-Frisian, common to Anglo-Frisian and North and South 
Germanic respectively and to Fris. and Norse only, are set in chronological order, 
and new and old features distinguished (pp. 188-248). The innovations are then 
interpreted in terms of impulses spreading from a center, and an explicit rule is 
stated for determining this center: “Das Kerngebiet . . . liegt dort, wo der Laut- 
wandel am friihesten beginnt und am konsequentesten durchgefiihrt wird” 
(p. 250). Thus tracing the origin of a number of changes back to Anglo-Frisian"® 
and “Inland Germanic” respectively,” the author concludes that these were 
“aktive Sprachlandschaften” (pp. 251, 254) over against the conservative North 
(p. 257), and that even where his criteria fail new features can safely be placed 
there.!* 


12 Noreen is criticized as ‘“‘etwas fiusserlich”’ for drawing attention to some of this (p. 
143). 

13 Spacing mine. 

4 Cp. S. Bugge, Norges Indskrifter med de xldre Runer I, Oslo 1891-1903, 154-158; R. C. 
Boer, op. cit. §5, 1. 

15 The linguistic argument on é! in Finn. miekka etc. remains circular: é need not derive 
from Goth. because it also existed in Proto-Norse (p. 46). Proto-Norse is shown to have 
é < é! by Finnish (p. 55). 

16 E.g. e > i before nasal + cons., because in Anglo-Fris. this also occurs before a simple 
nasal (p. 251). 

17 E.g. the inflected inf. has a dat. only in OE., but also a gen. and instr. in OHG. 

18 In the summary (p. 267) this confident view is somewhat qualified. 

19 What exactly determines the author’s choice between Anglo-Fris. and the South is not 
always very clear. For instance, he locates in the former the doubling of consonants (p. 
263) in spite of its wider scope in OHG., cp. OE. zxcer, lédan, OHG. akkar, letttan. If the ob- 
jection were allowed that this was due to later changes in prehistoric OE., the whole pro- 
cedure would be open to any arbitrary handling, as similar arguments could be used in every 
single instance. 
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Thus we arrive at a striking and, in so far as the underlying premises are ac- 
cepted, convincing picture of the Ausgliederung* of the Germanic languages. 
The oldest differences are between the North and the South, Gothic being an 
offshoot of the former. Between the 2nd and 6th centuries A.D. a number of 
innovations mark off ‘““North Sea Germanic” as a further distinctive unit. The 
process culminates in the emigration of the Anglo-Saxons which, creating clear- 
cut linguistic boundaries, gives full independence to each of the three groups 
(pp. 267, 276). 

In the present reviewer’s opinion, one of the chief merits of Professor Schwarz’s 
book lies in his full presentation of the relevant evidence. An equally detailed 
comparison of the old Germanic larguages in phonology, accidence, and vocabu- 
lary is found nowhere else. This feature alone gives his work its great and per- 
manent value, and makes it an indispensable guide to students of the subject. 
A word index would have increased its usefulness. 

The factual material is generally reliable as far as Germanic is concerned.” 
The references to Slavic are, however, often confusing. As “altslawisch’” the 
author promiscuously adduces forms both early and late, taken from OBlg., 
ORuss., or Proto-Slav (quoted without asterisk, e.g. konegs p. 46), some even 
appear to be of his own invention, e.g. ORuss. varjaze (p. 122). We are also 
told (p. 33) that OPol. Gruzenz goes back (“‘zuriickgehen”) to Mod. Pol. Grud- 
ziqdz, which seems a rather odd way of etymologizing.* Further there are some 
loose statements of uncertain meaning, e.g. certain Germanic consonant clusters 
are said to be difficult (for whom?) to pronounce (p. 71), ‘‘Alterserscheinungen”’ 
and “Streben nach Vereinfachung” characterize the Gothic tongue (p. 101). 
The Romance languages are described as ‘‘Volkssprachen” over against Vulgar 
Latin (ib.), and “‘Ansiatze” for the much later loss of the acc.pers.pron. in Engl. 
and LowG. are ascribed to Proto-North Sea Germanic (p. 232). Note should 
also be taken of a few misleading errata, such as JF for IF (p. 46 Note 1), W. 
for H. Kuhn (twice p. 66), kein for ein (p. 68), u for g < 8 (p. 99), ding for Ding 
(p. 128), Gétland for Gétaland (?) (p. 132), éyéene for cyéene (p. 193), Dat. for Akk. 
(p. 200), as. for ae. (p. 221), weunsdag for Woensdag (p. 238). 

The author’s reconstruction of the Gotho-Norse parent language rests on the 
belief that archeology PROVES its existence,* and on the further assumptions 


20 The term is borrowed from W. von Wartburg, ZRPh. 56, 1-48. 

21T note the following oversights and omissions: Rom. guerra < Frank. (not Goth.) 
*werra (p. 67); OE. gen. sg. -es < *-asa, not *-isa (p. 72); the fem in-stems have not disap- 
peared from OE., e.g. menigeo (p. 81); ON. gre, Goth. juhiza have the superl. ¢sir, juhists 
(p. 97), ep. A. Jéhannesson, [slenzk tunga i forndld, Reykjavik 1923/24, §379 Note 3; Goth. 
wulbus corresponds to OE. wuldor (p. 131); London speech was East Saxon, not West Saxon, 
and later adopted many Anglian features (p. 288), cp. B. A. Mackenzie, The Early London 
Dialect, Oxford 1928. 

22 This word is neither in Sreznevskij, Materialy, nor in Vasmer, Russ. et. Wb. 

23 Both the old spelling and the palatalized consonant preceding q point to OPol. (prior 
to 14th century) ¢ for q; cp. S. Slofiski, Historja jezyka polskiego, Lemberg and Warsaw 
1934, §§44, 72. 

% The reviewer fails to grasp how things hidden in the ground can prove anything about 
who spoke which language. 
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that the history of a people and of its language are closely connected or even 
completely parallel, and that identical names scattered over different written 
sources refer to what is genetically and linguistically the same people. All of 
these premises obviously need some qualification, and one would have wished to 
see them circumscribed more precisely. Reasoning along the lines of the cate- 
gories of Urvolk, Urheimat, and migration, we might otherwise be forced to the 
most startling conclusions.?> Under the circumstances it is difficult to see what 
exactly is meant by pronouncements like “In Wirklichkeit waren diese spaten 
Ostgermanen simtlich Nordgermanen” (p. 186).2* Linguistically the evidence 
presented is not cogent. Of the four isoglosses which, according to the author 
(p. 147), bear the main burden of proof, two are reconstructed forms (*berainé, 
*bérjau), and the remainder may be due to chance convergence. For ggj, ggw 
this has been shown by Kuhn,” whilst @ (+a) > 6, besides resting on an ad hoc 
assumption about the pronunciation of Goth. au, is duplicated, for instance, in 
Engl. a (+2) > 6.” 

The author’s argument essentially amounts to reconstructing what is peculiarly 
Gothic into pre-lst century B.C. dialect differences between North and South 
Germanic or, where it seems more convenient, within North Germanic. Thus 
the familiar pattern of East, North, and West, pushed back a little, re-appears 
in the new setting of Gotho-Norse, non-Gothic Norse, and South Germanic as 
the earliest distinct branches of the parent language. There is no independent 
evidence of such early dialectal diversity.? 

On the other hand the account given of the rise of Anglo-Frisian is much more 
convincing as the linguistic evidence is not forced into a pre-conceived non- 
linguistic pattern. Although the issue tends to be obscured by swarms of migrant 
peoples wandering in different directions in order to suit the history of mere 
tribal names, the linguistic and non-linguistic data are, on the whole, treated 
separately, and the latter utilized only to couch the final conclusions in historical 
terms. The family tree is resolutely discarded. The study of successive innovations 
with different geographical ranges throws into relief the intermediate position 
between the North and the South which is characteristic of Anglo-Frisian, and 
also intimates particularly close connections between Anglian and ON. A number 
of charts sum up the results. Disregarding details which one might quarrel about, 
esp. the reliance placed on the Gotho-Norse isoglosses previously reconstructed, 


25 Proto-Jewish, for instance, is to be reconstructed from the languages of the present 
Jewish communities. Caesar’s Belgae, who probably spoke a Flemish-French parent lan- 
guage, founded a new Empire in 1830. The Prussians, defeated by the Teutonic knights in the 
13th century, turn up before Paris in 1870. The Welsh migrate from their original home in 
Walachia via Italy (cp. Pol. wtoch ‘“‘Italian’’), France (cp. G. welsch “French’’), and Belgium 
(ep. Du. waals ‘‘Walloon’’), and finally conquer the Cymry, doubtless descendants of the 
Cimbri who escaped from Vercellae. Cp. further criticism in O. Spengler, Der Untergang des 
Abendlandes II, Munich 1922, 189-202; H. Kuhn, AfdA 63, 4-13; 65, 53-65; 66, 45-52. 

26 Are the Americans ‘‘in Wirklichkeit’’ English, Italian, or Negro? 

27 AfdA 63, 5 f. The author invalidates his counterargument (lack of evidence for ON. 
gw > ggw) by deriving ON. skuggi < *skugwa (p. 67). 

% Luick, Hist. Gr. §507. 

29 Cp. H. Kuhn, op. cit. 5-7. 





yen 
ten 
of 


te- 
the 
hat 
ten 
ace 
hor 
né, 


10¢ 
in 


rly 
ith 
1us 
ars 


ant 


ore 
on- 
ant 
ere 
ted 
cal 
ons 
ion 
ind 
ber 
ut, 
ed, 


ent 
jan- 
the 
e in 
um 
the 
des 


IN. 





REVIEWS 103 


and the further clarification needed concerning the position of Low German 
and Dutch, the present reviewer feels that in this field the author’s classification 
bids fair to replace the antiquated concept of Proto-West Germanic, and with 
the modifications to be expected from the progress of dialect geography to re- 
main valid for a long time to come. 

One could wish the investigation to have proceeded beyond the 5th century 
and traced the many later innovations in which two or more of the languages 
concerned continue to share and which gradually change the pattern of division, 
e.g. the East Norse loss of -r in the nom.sg. parallel to the earlier development 
in the south, the generalisation of -s in the gen. sg., the use of the dat. pron. for 
the acc., and the rise of a two case declension in Sw., Norw., Dan., Low G., Du., 
Fris., and Engl., the diphthongisation of 7, 7 in Engl., Du., and High G. etc. 
Thus today most of the oppositions on which the 6th century classification is 
based have ceased to exist and new groups have arisen.*° For instance, Fris. and 
Norw. would in their morphology go with Du. and Sw. respectively rather than 
with Engl. and Icel., while the latter shares with Faroese and German its four 
case and three gender system. In view of such later parallel developments the 
term Ausgliederung as used by the author seems to call for a closer definition of 
its linguistic significance. 

Stripped of its non-linguistic paraphernalia the theory presented comes very 
close to a re-statement of the views advanced long ago by J. Schmidt, and again 
by R. C. Boer. Disagreement with A. Martinet is limited essentially to Gotho- 
German relations. What makes Schwarz’s book superior to its predecessors is 
the wealth of material he has at his command to support his conclusions. 


Yale University HERBERT PILCH 


AnpRE Moret, Anthologie du Minnesang, Bibliothéque de Philologie Germa- 
nique, publiée sans la direction de A. Jolivet et F. Mossé, Volume 13. 305 
pp. Paris: Editions Montaigne, 1949. 

This anthology includes texts, an introduction, a bibliography, notes and a 
glossary. It was compiled by Professor André Moret, of the University of Lille, 
whose work in German medieval literature is well known. 

The introduction of some 65 pages is an excellent compact and lucid general 
statement of our knowledge, almost exclusively of German and French origin, 
of minnesong, which is discussed under the following seven headings: general 
aspects (including a brief résumé of the history of minnesong scholarship); the 
problem of origins; ideas and themes (better stated perhaps: themes and motifs); 
nature in the minnesong; forms and genres; language and style; metrics, music 
and mimetism plus a concluding word about the influence of minnesong, which 
in the thinking of Richard M. Meyer was comparable to that of the linguistic 
societies of the 17th century and which Konrad Burdach likened to that of the 
précieuses. 


3° The corresponding problem in Slavic has been taken up by F. Kopeén¥, K otdzce 
klassifikacie slovanskych jazyku, Slavia 1951. 
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The bibliography is compact and selected with a view to the poets and poems 
included in the text. 

The text embraces approximately 200 minnesongs from the early 13th century 
anonymous innovators, e.g. “floret silva nobilis” from the Carmina Burana to 
“the autumn of minnesong,” e.g. Henry von Miigeln, at the end of the 14th 
century. Walther von der Vogelweide is appropriately accorded nearly 85 pages 
of text for the wide and representative selection from his poetry. 

Proceeding from his basic assumption that minnesong is among the more com- 
plex questions of German literature (significantly this complexity is attributed 
to the lack of precise chronologies, of an inadequate understanding of the literary- 
musical character of the art of the minnesong, and of an equally inadequate com- 
prehension of the éfats d’ame), the author has admirably succeeded in reducing 
the complexity by succinct summary and sound evaluation. These in combina- 
tions with his representative selection of poems, and the appended study ap- 
paratus should afford French-speaking Europeans an excellent introduction to 
the study of German minnesong. 


Wayne University Harowtp A. BAsILIus 


ARNE RupskoGeErR, Fair, Foul, Nice, Proper: A Contribution to the Study of 
Polysemy. Gothenburg Studies in English, I. xi + 505 pp. Stockholm: Alm- 
qvist & Wiksell, 1952. 

In this volume, the first of the new series of English monographs from the 
University of Gothenburg, Dr. Rudskoger presents a detailed historical study of 
the four polysemic adjectives which are named in his title. Inspired partly by 
the work of his former teacher, the late Gustaf Stern,! and partly by Robert J. 
Menner’s article “Multiple Meaning and Change of Meaning in English,’ he 
has set out to provide an answer to the following question (p. 8): “If a word has 
two or more senses, which exist side by side with unchanged spelling and un- 
changed pronunciation, and these senses are employed in the same social sphere 
and in similar styles of speech or literature, and are used in the same or similar 
syntactical functions and phraseological surroundings, do or do not these senses 
tend to disturb or even destroy each other?” The affirmative answer given to this 
question on pp. 489-490 is supported not only by the full historical treatment of 
the four adjectives fair, foul, nice, and proper (Chapters III-VI), the announced 
subject of the book, but also by two other less complete sets of data: (1) a brief 
survey in Chapter VII of the sense-development of twenty-four polysemic 
adjectives drawn from various spheres of notion (the first four, admirable, bright, 
crazy, and cunning, are typical of all), which is designed to widen the scope of 
the study and to permit more specific conclusions than would be possible from 
the four adjectives alone; and (2) in Chapter VIII, a series of tables containing 
120 adjectives beginning with s-, recorded in the Ozford English Dictionary 
before 1500, for which, with the aid of the OED, the author has statistically 
studied the percentage of loss of senses and sub-senses. 

! Particularly by Stern’s early book Swift, Swiftly, and Their Synonyms (Gothenburg, 


1921), also by his monumental Meaning and Change of Meaning (Gothenburg, 1931). 
2 Language 21.59-76 (1945). 
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In my opinion, the materials presented in Chapters VII and VIII contribute 
little to the book and might better have been omitted. The conclusion drawn by 
the author from the statistical tables in Chapter VIII, that ‘‘loss of senses and 
sub-senses does occur even in words otherwise fit for use and that it occurs to a 
fairly great extent,” has been known for a long time and from much more ex- 
tensive evidence than is presented here. The analysis of the twenty-four adjec- 
tives in Chapter VII is inconclusive because it is incomplete and impressionistic. 
Full historical studies of these words, as well as of other English polysemic 
adjectives, following the pattern of Chapters II-VI, should be available before 
any final answer is attempted to the question quoted in the first paragraph of 
this review. 

The last three chapters are devoted to general historical discussions: Chapter 
IX to possible reasons for the loss of a whole word, Chapter X to possible reasons 
for the loss of one or more senses of a word, Chapter XI to a very brief con- 
clusion. It is not easy to see why Chapter [IX has been included, since none of the 
twenty-eight adjectives studied in Chapters III-VII has been entirely lost from 
the language, and since Rudskoger’s materials represent only a slight enlarge- 
ment of what has already been set down by earlier scholars in this field (Hemken, 
Oberdérffer, Offe, Teichert, Menner, and Miss Williams, to mention only those 
who have concerned themselves with the loss of English words). The generaliza- 
tions in Chapter X are, as has been suggested above, premature in view of the 
present state of our knowledge. 

The fact is that in this book the author has tried to do two different things, 
neither of which he has achieved completely. That he himself is aware of this 
lack of unity in his work may be inferred from the opening words of the last 
chapter (p. 488): “On one hand, I wanted to treat a more special and restricted 
problem, viz. the history of 4 common Eng. adjs., fair, foul, nice and proper. I 
wanted to try and map out their sociological position in the broader sense of 
that word . . . On the other hand, however, I wanted to treat a problem of a more 
general nature, which problem became the chief one to me. I wished to find out 
whether and to what degree polysemy, i.e. multiplicity of meaning, in a word, 
was of importance as causing loss of sense or senses . . . I had to proceed in two 
different ways, which were somewhat difficult to reconcile.” This difficulty is 
also felt by the reader, who is puzzled by the unusual progression from the specific 
to the general and is uneasily aware that the points of greatest emphasis in the 
book are not the most productive of specific results. 

Now that these negative criticisms have been set down, it is a pleasure to say 
that the historical investigation of the four adjectives fair, foul, nice, and proper 
has been admirably carried out and will remain a valuable contribution to the 
historical semantics of English. The author’s methodology, presented in the first 
two chapters, is in several respects an advance on that of previous workers in this 
field of linguistics. The principal innovation is the use of the term “notion-field,” 
an adaptation of the German “Begriffsfeld” first used by Jost Trier more than 
twenty years ago.’ In Rudskoger’s terminology notion-fields are areas of meaning, 


3 Der deutsche Wortschatz im Sinnbezirk des Verstandes (Heidelberg, 1931). 
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much like the categories in Roget’s Thesaurus but less extensive, overlapping 
each other and not so sharply defined as the “senses” which we find in the dic- 
tionaries. For example, for the adjective fair he recognizes sixteen notion-fields, 
ranging from “physically beautiful,” which is the oldest (c888 in the OED) down 
to “complete, sheer” (1872 in the OED). Each of the subjective, individual 
meanings of a word is smaller than a notion-field and moves about within it 
(for example, fair in the sense ‘‘physically beautiful’? may mean one thing to one 
person and something else to another, but the notion-field is the same in both 
cases), but each of the objective meanings covers a complete notion-field and 
contrasts with the other notion-fields within which the word moves. Change of 
objective meaning, then, involves the extension of a word to new notion-fields or 
its withdrawal from notion-fields in which it has previously moved. In most 
cases the objective meanings of a word in past periods of time (that is, the notion- 
fields) can be ascertained, but the individual fluctuations within a notion-field 
are ordinarily beyond the reach of historical investigation. 

In the historical study of meaning and change of meaning in nouns and verbs 
it is notoriously difficult, in the absence of formal criteria, to avoid subjective 
interpretation of the data. In a study of adjectives, however, some formal criteria 
are possible. Of particular importance in Rudskoger’s study is the classification 
of adjective uses according to the “determinants,” that is, according to the nouns 
or other words which the adjectives modify. For example, in fair woman, fair 
weather, and fair play three different notion-fields are involved, and in each case 
the determinant facilitates the assignment of the adjectival use to the proper 
notion-field. 

In these as in other respects, the chief merit of Rudskoger’s work lies in his 
greater use of formal criteria as compared with most of his predecessors. The 
dependence on “psychic” and “logical” criteria, which we find even in the work 
of Gustaf Stern, the distinction between ‘intentional’ and “unintentional” 
changes of meaning, and other features of the older methodology are noticeably 
lacking here, and we may hope that they will be similarly lacking in future studies 
in this field. Rudskoger’s historical studies of fair, foul, nice, and proper may 
well serve as models for other investigators of meaning and meaning change, not 
only in English but in other languages as well. It is a pity that (“for reasons of 
space,”’ as he tells us) his study of the adjective plain had to be omitted; but no 
doubt he will be able to publish it elsewhere. 


Columbia University Exuiort V. K. DoBBIE 


A. Couen, The Phonemes of English: A Phonemic Study of the Vowels and Conso- 

nants of Standard English. 127 po. The Hague: Martinus Nijhoff, 1952. 

The phonetics, traditional or functional, of Standard Southern British has 
by now been quite well thrashed out by Daniel Jones and others, and such a book 
as Cohen’s cannot aim at anything but restating old data in a new fashion. Here, 
once more, the language is a pretext for theoretical arguing and the book is to 
be considered as a contribution to structural theory rather than to English studies. 

In this Amsterdam dissertation, the author is obviously intent upon affording 
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proofs that he has considered and evaluated all the relevant literature. The re- 
sult is that, at first sight, he appears as an eclecticist, adopting some features 
from one theory, other features from another, using at every point one approach 
or one author against the others. Like the Bloomfieldians, he classes his phonemes 
on a purely distributional basis. His choice of the semantic function as the basic 
criterion in establishing the phonemic inventory is reminiscent of Prague theory 
and practice. His rejection of phonetic criteria for the identification of phonemes 
is a tribute paid to glossematic ideals. Yet Cohen’s haggling and wriggling has 
ultimately a very surprising result, not only overlooked by the reviewers of the 
book, but of which the author himself seems to have been unaware: IT FILLS THE 
LAST HOLE IN THE PATTERN OF STRUCTURAL DOCTRINES. If, after Martinet (An- 
thropology Today 574-586), we classify these doctrines by reference to their 
attitude toward the two substances (expression and content), we will find Prague 
operating with both phonic and semantic substance, Glossematics rejecting both, 
Bloomfieldians using sounds (phonetic similarity) but rejecting meaning. There 
remained a last theoretical possibility, opposite to the Bloomfieldian approach: 
rejection of sounds but adoption of meaning, which nobody had thought or cared 
to put into practice, and this is precisely the one into which Cohen has finally 
stumbled. 

One of Cohen’s main preoccupations must have been to convince his readers 
that his obvious eclecticism is never detrimental to the rigor of his approach. 
He is clearly convinced that substance and rigor are incompatible and his, on 
the whole, half-hearted concessions to meaning are always accompanied, if not 
by a translation into behavioristic terms, at least by a behavioristic profession 
of faith. He has not seen that, in these things, the differences between Prague 
followers and Bloomfieldians are purely verbal: whether we say that A ‘‘means” 
something different from B, or that A elicits a different response from that 
elicited by B, we refer exactly to the same thing or, at least, to the same phonemic 
procedure. It is certainly useful to distinguish in theory between ‘referential’ 
and ‘differential’ meaning, but it should be clear that phonemicists of all schools 
have always operated with differential meaning. The reference to ‘‘intellektuelle 
Bedeutung”, so common in early Prague phonemics and against which Cohen 
polemicizes, never implied referential (as opposed to ‘differential’) meaning but 
was meant to exclude semantic-phonic deviations of affective origin from what 
was called later the commutation procedure. There can be no doubt that, if 
meaning is deemed detrimental in phonemic analysis, the only solution is its 
total rejection as proposed by B. Bloch. One may disagree with it, but it remains, 
that in this respect, it is the only one that goes beyond the theory and practice 
of Prague phonemics. 

Seduced as he is by “‘purely linguistic’’ criteria, Cohen could not refrain from 
attacking Van Wijk’s highly mentalistic approach. However, he does so rather 
awkwardly when he takes exception to one of Van Wijk’s statement according to 
which language is “the collective possession of a community, living in the mind 
of each member as a system of a certain structure”. This does not at all mean, as 
Cohen maintains, that “we should have to investigate what goes on in the mind 
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of speaker or listener’”’ (14). It just means that Van Wijk refused “to conceive of 
a language as nothing but a body of actual utterances ... because... the ut- 
terances are not ‘the language’ but only the outward manifestation of it.’’ (Mar- 
tinet, tbid., §1.13). The real question in connection with Van Wijk’s views on 
phonemics is whether an appeal to ‘linguistic feeling’ is the right procedure for 
determining what the phonemes of a language are. Of course, it is not because the 
speakers ‘feel’ some feature to be relevant that it should be listed as such; on 
the contrary, we have to assume that it is ‘felt’ relevant precisely because it is 
distinctive, as shown by commutation or by other objective techniques of 
phonemic identification. But since speakers have, throughout their lives, to 
keep distinct the words and morphemes of their language, they may well have 
something like a ‘feeling’ of what is phonemically relevant. We might define a 
‘linguistic feeling’ as A SET OF NON-FORMULATED (OR IMPLICIT) JUDGMENTS OF 
LINGUISTIC EXISTENCE, which would give it a legitimate operational status. Con- 
ceived as such, what we usually call a ‘linguistic feeling’ appears to be nothing 
but the condensed result of a series of judgments based on objective linguistic 
data. Long experience and every day practice of his own language enables the 
native speaker to perform automatically (I mean mechanically and very rapidly) 
a series of linguistic judgments, the result of which he naively considers as a ‘feel- 
ing’. Of course, even this rigorized type of ‘linguistic feeling’ cannot be con- 
sidered a sound criterion in phonemic analysis; it may however prove an ap- 
preciable help as a symptom leading to the discovery of objective criteria. It 
will be the task of the linguist to guide the subject in his effort to discover the facts 
and make explicit a judgment ultimately based upon these facts, a judgment 
he naively mistakes for a subjective feeling. This can be safely done by questions 
eliciting yes-or-no answers, progressing step by step from the more general 
toward the very particular, somewhat as in the Twenty Questions Game. In 
any case, granted that linguistic feeling can never be a criterion, it may prove, if 
correctly handled and rigorously checked by reference to objective data, a most 
useful aid in descriptive practice. 

If rigor is obviously Cohen’s ideal, he is still very far from it in his handling 
of theoretical problems, where hearsay, verbum magistri, and subjective prefer- 
ences are at times discernible ingredients. His chapter ‘Phonemes and their 
Interrelations’ is most characteristic of a looseness often camouflaged behind 
verbal clouds. The way the doctrine of relevant features is laboriously juggled 
away (30-34) proves that the author is confusing the operation whereby the 
linguist determines which phonetic features are relevant for a given phoneme, 
with the actual functioning of the language when normally spoken. According to 
him, there should be ‘“‘no permanent set of distinctive qualities, functioning in 
speech, but rather a permanent set of marking qualities, of which now one 
quality, now another is relevant or distinctive.” (31). Does he mean that when we 
say ‘go get the pail’ all the marking qualities of /p/ are relevant because we might 
have expected to hear bale or mail instead of pail, but that, if we mention ‘pisca- 
torial techniques’, none of those marking qualities are relevant because neither 
*biscatorial nor *miscatorial exist in the language? If he does, then he forgets 
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that both the p of pail and that of piscatorial will easily be identified by an in- 
terlocutor before the rest of the word is pronounced. What is important, for the 
phonemicist, is not that * biscatorial and * miscatorial do not exist, but that 
according to the syntagmatic rules of phonemic combination they are possible 
in English. If Cohen does not agree, the only solution lies in the corpus approach. 
Unfortunately, even this emergency issue is blocked: his analysis of English 
phonemics is Not based on a definite corpus. 

The real merit of Cohen’s enterprise is that it represents an effort to confront 
the various existing methods of linguistic analysis and thereby to take the first 
step toward the establishment of a terminology which adherents of all schools 
and trends could at least understand and use in discussions with their scientific 
opponents. In this reviewer’s opinion, the method proposed in this book will 
hardly gain Cohen supporters or followers. However, having for the first, and per- 
haps last, time combined in phonemic analysis the rejection of phonetic criteria 
with the utilization of semantic function, he has secured for himself a little 
corner in a future history of modern linguistics as the lonely filler of the last gap 
in the pattern of structural methodology. 


University of Washington ALPHONSE JUILLAND 


Joun SAMUEL Kenyon, American Pronunciation. Tenth Edition, Second Print- 

ing. x + 265 pp. Ann Arbor: George Wahr, 1951. 

American Pronunciation is not the new book usually reviewed in scholarly 
journals, but quite the contrary an old well-known work, long become a classic 
in its field. 

As such, the book has enjoyed enough success to have gone thru ten editions, 
including one major and one minor revision since its initial publication in 1924. 
The edition under review is of course the tenth. 

Thru the years, Kenyon’s work has been an invaluable weapon in the hands 
of those speech teachers who have sought to free their field o/ the stifling influence 
of the “elocutionists’” and their pedantic, pseudo-“Eastern American” speech 
standard. At the time that the book was first published, it was an especially 
important task to establish the equal antiquity, and therefore equal respecta- 
bility in the eyes of many, of American English with British English, and to 
fight for the incorporation of phonetic training in the speech curriculum. The 
scholarship of Dr. Kenyon put a linguistic floor under the teaching of speech 
in American schools and colleges, and demonstrated thru both historical and 
descriptive techniques the true nature of American speech types cultivated on 
American soil by real speakers, where before there had been only the amorphous 
dogma of the “esthete” and the dilettante. 

One cannot but marvel at the encylopedic nature of this small book with its 
detailed physiological descriptions of American English sounds, discussions of 
spelling-pronunciation, historical phonology, assimilation, stress, and other 
phonetic problems. As a book intended not for linguistic scholars, but for class- 
room use, there is much that is excellent and can contribute mightily to the 
knowledge of every student of American speech. 
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We can only regret, therefore, that American Pronunciation has failed to keep 
pace with the giant steps taken by linguistic science in the past quarter century. 
There has been no serious attempt to bring the book truly up to date in matters 
of phonemics and of structural linguistics; there have been only minor changes 
of detail and organization. 

For example, no effort has been made to separate linguistic history from 
linguistic description. These two levels are mixed everywhere in the book. In a 
context which is clearly descriptive of the general distribution of shwa in modern 
English, we are told that the [a] in the second syllable of postman is descended 
from the /a/ of Old English /man/. Elsewhere, the form ’em is traced back to 
earlier English hem, which was replaced in stressed position by Scandinavian 
them. The identification of ’em with them is termed a “‘mistaken belief.’’ Of course, 
such an identification may be a historical or etymological ‘“‘mistake” but in 
modern English it is no mistake at all. 

In the discussion of assimilation, altho examples are distinguished as to func- 
tional level (formal, colloquial, standard, sub-standard, etc.), no such distinc- 
tion is drawn between diachrony and synchrony; all assimilations are lumped 
together: such cases as [barm ba] by and by, [1t kemp bi dan] it can’t be done, and 
[rezbert] raspberry are cited without identification, altho the first two are clearly 
illustrative of present-day dynamics whereas the last one is for modern English 
merely a fossilized form which Kenyon would not even be able to cite if it were 
not for the spelling. 

Kenyon is known also for his Guide to Pronunciation, written for the Merriam- 
Webster dictionaries, and A Pronouncing Dictionary of American English co- 
authored with the late Thomas A. Knott. All of his work contains a wealth of 
material and, consequently, food for thought for every Anglicist. It is unfortunate 
that more American linguists working in American English phonemics have not 
taken advantage of these labors, for the material, tho rich, is raw and fairly 
begging to be digested. 


The City College of New York MarsHAL.L D. BERGER 


J. Aupen Mason, The Language of the Papago Indians, iv + 84 pp. Philadelphia: 

The University Museum, University of Pennsylvania, 1950. $1.00. 

The dialects of Papago shade off into those of Pima, the distinction between 
Pima and Papago being cultural rather than linguistic. ‘Papago and Pima are 
coterminous. The Pima are the “River People,” the Papago are the “Desert 
People” ’ (p. 4). Papago-Pima constitutes one of the two extant languages mak- 
ing up the Piman family, the other being Tepecano-Tepehuan. Southern Tepe- 
huan is thought by Mason to be probably close to Tepecano, more or less as 
Pima is to Papago, but this matter and the relation of some extinct Piman dia- 
lects need further study. The whole Piman family has a large territorial spread, 
extending about 1000 miles north to south in Arizona and northwestern Mexico. 
This extension is interrupted for considerable distances by Cahita, Tarahumara 
and other Uto-Aztecan languages, only distantly related to Piman. Piman 
speakers today number several thousands in scattered villages and reservations. 
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Piman is one of the families making up the Utaztecan branch of Aztec-Tanoan. 
It is (p. 5) ‘the most aberrant of the ‘Utaztecan families.’ A further comment on 
the makeup of Utaztecan involves a considerable omission, probably a copying 
error: ‘According to Whorf this is divided into Taracahitian, Aztecoidan, and 
Piman families.’ The full list of Utaztecan families would include also Cora- 
Huichol, Luisefio-Serrano-Gabrielino, Hopi, Mono-Paviotso, Shoshoni-Co- 
manche, Ute-Chemehuevi. 

Mason’s monograph, prepared for publication in 1941, but not published until 
1950, gives the results of intermittently continued research going back 30 years 
to about 1911. It was A. L. Kroeber who began the study, and much of the col- 
lecting of raw data—texts and morpheme lists—was done by Juan Dolores, a 
Papago Indian, who was trained by Kroeber both to collect and to analyze forms. 
Mason took over the work in 1919, after working on the related Tepecano; 
he spent a few weeks in the field and afterwards worked up the material as- 
sembled by all three men. In the thirties, George Herzog made a musicological 
and linguistic field trip among the Pima, and Mason subsequently arranged for 
him to go over the Papago monograph, making observations and additions. Thus 
this study is the collective product of four people, one of them a native speaker 
of the language and the other three linguistically trained scholars. The result is 
a rounded, careful treatment of the grammatical structure, marred by some in- 
sufficiencies in the analysis of phonetics. 

Mason describes the inflections, which include singular versus plural in the 
noun and in the verb, and extensive versus punctual aspects in the verb; and 
he gives a classified list of affixes. For each inflection and for each affix, he gives the 
rules of formation and the phonological changes which take place, along with 
extensive lists of examples, which either are illustrative or attempt to be ex- 
haustive, according to the needs of each case. For every affix, he states clearly 
the type of element to which it is attached, the class (whether noun or verb) of 
the formation, the meaning, the frequency of occurrence, restriction of usage, 
and other pertinent matters. His treatment, based on Dolores’ word lists and 
many pages of text and checked with other scholars, impresses one as con- 
scientious and well-conceived. 

Mason indicates (15) that word boundaries are not easily established, and that 
the phrase is probably the real phonetic unit. By the same token, there is no 
hard and fast distinction between affix or enclitic and independent words. There 
are, however, perhaps forty or fifty elements which are always or nearly always 
unaccented segments of complexes built around other more freely occurring 
forms, especially nouns and verbs; the unaccented elements can conveniently be 
treated as affixes. (Many of them, in meaning and usage, are reminiscent of weak 
elements—pronouns, negative, prepositions, articles—in French.) Some ele- 
ments, which are very much at home in affix-like usage, are also found to occur 
independently, for example ?am, the independent equivalent of ‘there’, also used 
in postclitic position to express general location, e.g. kii?am ‘at the house’. Another 
interesting example of varying function is the relation of postclitic-wam ‘with’ 
or ‘accompanying’, the verbs wana?i ‘put with’ (-na- fossilized causative, as 
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suggested by the cognate element in Tiibatulabal) and wama?t ‘assist’, and the 
noun wamahkam ‘companions’ (-hkam regular agent suffix). Again, the first person 
singular prefix 7i7- is also found suffixed to a demonstrative element to form the 
independent pronoun ?aa7ii, and there are partial or complete parallels for other 
pronouns. Some verbalizing suffixes resemble independent verb roots in having 
two separate forms for punctual and extensive, but verb compounding (or verb- 
plus-verb construction) is not otherwise regular. An interesting detail, reflecting 
the separateness of elements in still another way, is the fact that the punctual 
aspect is marked not only by a change in the verb stem but also by a suffix, -ht, 
added to the pronoun in a complex which may either precede or follow the verb. 

Plurality in both the noun and the verb is expressed by initial reduplication 
of the stem, with the accent falling on the first part of the reduplication. In the 
verb, stem-expressed plurality applies to the subject of intransitives but to the 
object of transitives, or optionally to repetition of the action in either type. A 
few verbs have suppletive plurals, employing totally different stems for singular 
and plural. 

The two aspect forms of the verb stem are generally differentiated by their 
length, the punctual being generally shorter by a consonant or by a whole syl- 
lable, but there are also several interesting variant formations applying to given 
stems. The differences doubtlessly reflect the morphologic and phonologic pre- 
history of the Piman language family. Presumably the extensive reflects the 
fullest original form of the stem, preserved by the influence of suffixes which 
have since been lost, while the punctual form represents the fate of the unpro- 
tected stem final. 

Mason’s analysis of the phonetics seems to embrace one non-existent contrast, 
evidently conceived on the basis of historic and descriptive phonology and in the 
face of the actual phonetic evidence. He writes (8): ‘The difference between surd 
and sonant stops and affricatives is far less in Papago than in English, and a 
careful ear is needed to avoid confusion. My equivocal renderings are plentiful. 
Most previous recorders of Papago and Pima have not recognized the sonant 
stops and have recorded all as surds. . . .’ Earlier, in his introduction to Dolores’ 
Papago Nominal Stems (University of California Publications in American Ar- 
cheology and Ethnology 20.20, 1923), he wrote, ‘The question of the sonant 
stops and affricatives is a most difficult one. As in Tepecano, the difference is 
very slight and confusion certain. The editor was frequently at fault in his 
renderings while the author was never fully convinced of the existence of sonant 
stops. Taught at first to write all as surds, he was never able accurately to dis- 
tinguish between them. However, the surd stop is always preceded by a voiceless 
sound, generally either the aspirate h or a voiceless vowel, while the sonant 
stop is as invariably preceded by a voiced sound or a glottal stop.’ Mason should 
have been wary of a supposed distinction which a native could never be fully 
convinced of, and he was unduly unconcerned about his own difficulty in dis- 
tinguishing the sounds. A consideration of the evidence as stated by Mason 
himself should indicate the real state of affairs, and the solution to be given now 
was further confirmed by the reviewer recently thru brief contact with Papago 








the 
on 
the 
her 
ing 
rb- 
ing 
ual 
-ht, 
rb. 
ion 
the 
the 


lar 


1eir 
syl- 
ven 
re- 
the 
‘ich 


ast, 
the 
urd 
da 
ful. 
ant 
res’ 
Ar- 
ant 
e is 
his 
ant 
dis- 
less 
ant 
uld 
ully 
dis- 
son 
10W 
ago 





REVIEWS 113 


speakers. The true phonemic analysis agrees with that originally worked out by 
Kroeber and Dolores. There is only one phonemic series of stops, with very 
minor phonetic differences entirely correlated with the preceding sounds. In 
medial position one finds a contrast between consonant sequences of the type hp 
and simple stops without preceding aspiration. This contrast cannot occur in 
initial position, but the initial stop of some elements takes on a preceding h 
whenever the element occurs in non-initial position. The word or phrase medial 
contrast of hp and p in Papago-Pima corresponds to the contrast of p and b 
in Tepecano. Altho Mason compares the difficulty of distinguishing the two 
supposed types of stops in Papago with the situation in Tepecano, an examina- 
tion of his monograph on the latter language ‘“Tepecano, a Piman language of 
Mexico,” Annals of the New York Academy of Sciences 25.309-416, 1917, esp. p. 
317) indicates that the analogy is not particularly significant, since in Tepecano 
he found it difficult to distinguish only in final position. 

Of course, to have a full account of a morpheme it is necessary to know the pho- 
netic form it will take in medial as well as in initial position. And, in this respect, 
it will indeed be necesarry to supplement Dolores’ record, for (p. 8) ‘. . . Dolores 
generally wrote all words in their independent phonetic forms.’ 

The phenomenon of pre-glottalization, which Mason discusses in contrast to 
pre-aspiration, would seem to be partly a matter of a contrasting type of con- 
sonant cluster and partly a matter of over-recording an automatic phonetic 
feature which is not independently significant. 

Papago has several instances of limited phonemic contrast, involving pairs of 
phonemes which are largely in complementary distribution but occur as real 
contrasts in a certain number of cases. Thus, the contrast of ¢ versus ¢, of s versus 
§, of 7% versus n, of | versus r. As may be seen both in the descriptive phonology 
of Papago and by comparison with Tepecano, these contrasts come from re- 
cently developed phonetic variations depending on the following vowel, overlaid 
by still more recent phonetic changes in the vowels which disturb the mechanical 
character of the positional variation. 

The interrelated phonomena of final aspiration, devoicing and the phonemic 
status of the central high vowel (a) present some interesting problems. Final low 
vowels (a, 0) and final accented vowels, both high and low, are generally followed 
by aspiration. It is possible that there is a weak aspiration, representing a me- 
chanical feature of sentence final, and a stronger aspiration, constituting the 
phoneme h. The high vowels 7 and u are voiceless when unaccented and final or 
in the position between two obstruent consonants (p t ¢ k s § h). The high central 
vowel (a) follows similar rules, if one recognizes this vowel to exist in positions 
where Dolores occasionally but inconsistently wrote it and where Mason con- 
sidered the feature to be mechanical. As the latter explains (p. 10), ‘In trans- 
cription final unvoiced E has generally been ignored, as it is presumably indis- 
tinguishable from pure aspiration without vocalic timber after final surd 
consonants. The same applies to voiceless E between two surd consonants; it 
probably represents p’ire aspiration.’ There are then two possible interpretations: 
(1) to regard the voiceless central timbre as a positional variant of a or (2) to 
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regard it as a mechanical feature of consonants in certain positions. Perhaps the 
two interpretations are equally valid. The second gives a more economical trans- 
cription, particularly in view of the fact that these central-timbre aspirations 
are very frequent. The first one gives a simpler pattern of syllable structure and 
greater uniformity among the high vowels. However, the parallelism of the high 
vowels is not complete. For one thing, unaccented a, if we so classify it, is far 
more frequent than either 7 or u. For another thing, devoicing of this a occurs 
in some positions where 7 and u keep their voicing. However, the precise rules of 
devoicing need further study. 


Denver, Colorado Morris SwADESH 


Guapys A. Reicuarp, Navaho Grammar, Publications of the American Ethno- 

logical Society, No. 21. xiii + 394 pp. New York: J. J. Augustin, s. d. 

Nous avons enfin une grammaire exhaustive de la langue la plus importante 
de la famille athabasque. Cette famille de langues, avec ses catégories grammati- 
cales si originales et si nettes, n’intéresse pas seulement les américanistes et les 
théoriciens de la linguistique, mais aussi les orientalistes puisque c’est un des 
derniers groupes linguistiques 4 avoir franchi les détroit de Behring, et qu’il 
faudra en tenir compte pour reconstituer les catégories grammaticales perdues 
des langues sinotibétaines. 

Pour le lecteur européen cet ouvrage offre une agréable surprise: on peut le 
lire sans avoir en maine une table de logarithmes et sans connaitre les mystéres 
de l’analyse factorielle. Un tel manque de conformisme venant du Nouveau Monde 
étonne toujours. Peut-étre cependant pensera-t-on que |’auteur a été trop loin 
dans sa réaction contre le jargon prétentieux et la rigueur factice. Un ouvrage de 
cette importance doit se suffire 4 luiméme. Or, la description phonologique est 
insuffisante si l’on n’a pas sous la main (c’est mon cas) la Navaho Phonology de 
Hoijer. Il était certes indispensable que |’auteur indique ses divergences théo- 
riques et pratiques avec Hoijer, mais pourquoi n’a-t-elle pas explicité les nom- 
breux points d’accord et confirmé, par exemple, que / et 7 (2) ne se trouvent a 
Vinitiale que dans les emprunts & |’espagnol? 

Le chapitre intitulé “Phonology” (13-45) traite, semble-t-il, des alternances 
morphonologiques, et il n’apparait pas que combinaisons impossibles et neu- 
tralisations aient été signalées nulle part. Un chapitre intitulé “The Word” 
(46-55) traite des différences et des ressemblances entre le nom et le verbe, et 
montre que les adjectifs ne sont qu’une variété du verbe, de sorte que le chapitre 
sur l’adjectif ne contient que des indications sur |’expression de la comparaison 
et du .ombre (147-153). Le chapitre du nom traite de la formation et de la 
dérivation et énumére quelques emprunts (56-79). La flexion possessive est 
renvoyée au chapitre du pronom ow sont énumérés les pronoms proprement dits, 
invariables, et les préfixes personnels utilisés dans la flexion du nom et du verbe 
(80-91). Un chapitre sur les “formes liées”’ (‘bound forms’; 92-118) énumére les 
démonstratifs, les interrogatifs et les verbes qui traduisent nos prépositions. 
Le chapitre du verbe ne traite que des flexions de la racine (119-146). Celui qui 
est consacré aux préfixes traite de la conjugaison en reprenant méme les préfixes 
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personnels déja indiqués (154-292). Le chapitre de la syntaxe indique la valeur 
habituelle ou grammaticale de l’ordre des mots dans la phrase et la fagon dont 
s’exprime |’interrogation et la négation (293-331). Un chapitre sur |’usage et le 
vocabulaire est précieux par les idiomatismes et les particularités sémantiques 
qu’il enseigne (332-368). Enfin le dernier chapitre “Speech Diversity” (369-381) 
présente les variations de prononciation, individuelles ou stylistiques, des mots 
ou des phonémes. Un texte, avec traduction mot-d-mot et commentaire gram- 
matical (382-391), et deux pages de bibliographies terminent le volume. Pour 
les abréviations qui suivent les exemples il faut malheureusement se reporter 
& la page x. 

Cette grammaire si riche et si touffue aurait été éclairée par des tableaux 
d’ensemble (comme celui de la structure du verbe donné par Hoijer dans J AL 
11.196) et par un index des formes, mais c’est le sort des ouvrages trop riches 
de faire craindre au lecteur de ne pouvoir utiliser toute cette richesse. 


Paris AnpréE G. HaupRICOoURT 


BENJAMIN P. LEKENS, Dictionnaire Ngbandi (Ubangi-Congo Belge). Annales 
du Musée Royal du Congo Belge. Sciences de l’Homme: Linguistique, Vol. 1, 
xii + 348 pp. Anvers: Editions de Sikkel, 1952. 

G. Huustarrt, Dictionnaire Francais—Lomongo (Lonkundo). Annales du Musée 
Royal du Congo Belge. Sciences de l’Homme: Linguistique, Vol. 2. xxxii + 466 
pp. Tervuren, 1952. 

These dictionaries are the first two of a series of publications to be continued 
under the auspices of the Commission de Linguistique Africaine (of the Belgian 
Colonial Ministry). In every respect they are of the highest quality—not only in 
their demonstration of excellent linguistic and lexicographical methodology, and 
in their complete yet concise treatment of the material involved, but also, and 
not least of all, in their efficient and most attractive format and in the superb 
quality of their printing, paper, and binding. It is doubly valuable to have works 
of this high caliber available when they are also so easy and pleasant to use. 

Ngbandi, by Greenberg’s classification, is in the Eastern Branch of the Niger 
Congo family. It is spoken along the Ubangi river in the northern Belgian Congo 
and in adjacent parts of French Equatorial Africa. One of its dialects has become 
a widespread lingua franca in parts of French Equatorial Africa even beyond the 
normal boundaries of its speakers, under the name of Commercial Sango. The 
vocabulary and structure of this trade form, with its larger amount of borro.wed 
material, deserves separate treatment. Lekens confines his study to the speech of 
an area in the Belgian Congo which is precisely located on his map. (In a more 
recent and more extensive publication, Lekens collaborates on a fuller treatment 
of the lexicon of the language, including material from other dialects.) 

In a brief introduction, Lekens gives a concise and eminently clear outline of 
his transcription, including tone. The only clear case of nonphonemic transcrip- 
tion is the use of kpm before nasalized vowels and kpw before oral vowels, both 
of which may surely be interpreted as /kp/. For the corresponding voiced 
phoneme, gb is consistently written before all vowels. Mid tone appears to have 
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two allophones conditioned by adjacent pitches: but a few isolated exceptions 
raise the question of a possible contrast between them. In the transcription of 
tone, a suggestion of Westermann’s is adopted: leave initial low tone unmarked, 
and leave unmarked any tone that is the same as a preceding marked tone. This 
is a convenient convention for many languages, but is often adopted uncritically 
in cases where it is by no means the best solution. In many languages, the 
structure suggests quite different conventions for simplifying the transcription. 
In the case of Ngbandi, it seems to work well enough, but warning should be 
given against assuming that it will work in any language. 

Lekens’ dictionary of Ngbandi is two-way: French-Ngbandi and Ngbandi and 
French. Specific semantic ranges of French words are often distinguished by 
Flemish (Dutch) glosses, which are space-saving and can hardly be criticized 
considering the main body of consumers. It would have been preferable, how- 
ever, to place all cases of gb after all cases of g-, rather than between ga and ge; and 
similarly to enter kp separately rather than between ko and ku. Likewise, all 
cases of ¢ after e and of 9 after o would be better than intermingling them. Even 
the unsophisticated user would profit from such reminders of the phonemic 
structure of the language. By and large, however, Lekens has succeeded admir- 
ably in preparing a dictionary that any learner of Ngbandi can profitably use, 
without technical linguistic training, and which at the same time—or more 
likely for this very reason—is a valuable source book for the specialist in lin- 
guistics and African languages. 

The second dictionary is of a Bantu language, Lomongo, spoken in the forest 
area of the great curve of the Congo river, by possibly as many as two million 
speakers. Hulstaert gives not only an outline of his transcription, including tone, 
but also a grammatical sketch. This is designed not simply for abstract infor- 
mation, but for more efficient use of. the dictionary. Someone reading a 
Lomongo text will find it necessary to extract the stem from a complex word be- 
fore looking it up. Even to use this part of the work (only French-Lomongo in 
this volume), the hyphenated entries take on meaning only in the light of the 
grammatical material given in the introduction. Introductions to dictionaries, if 
carefully prepared, are extremely valuable. It is regrettable that so few users of 
dictionaries ever read them. In many cases, as this, a simple introduction makes 
possible a much more efficient lexical treatment in the body of the work. 

Both authors are to be congratulated on their sound and extensive treatment 
of lexical ranges and of phrases involving special usages. These dictionaries are 
far more than word lists. Both of them give an excellent picture of the usage of 
words in the languages concerned, and reflect the range of French usage for 
which equivalents may be sought. Hulstaert in particular outlinesin his introduc- 
tion several types of equivalence patterns, and stressed them within the dic- 
tionary by such devices as listing French nouns (e.g., bonté) but not adjectives 
(e.g., bon includes a reference to bonté and, in addition, only some special phrases). 

It hardly seems necessary to congratulate both authors also on their consistent, 
accurate, and practical notation of tone. Such a procedure should, by now, be 
taken for granted in dealing with African languages. However, there still seem 
to be some would-be linguists who believe that tone is some exotic phenomenon 
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that cannot be properly treated in transcription, nor properly reproduced by 
Europeans. Lekens and Hulstaert treat this attitude for what it is—uninformed 
folklore. It is to be hoped that their example will be an encouragement to others 
to do as careful and commendable work. 

Africanists will anticipate further publications in this series with the highest 
hopes for a continuation of such accurate and excellently presented information 
on more languages of the Congo. 


Cornell University Wa. E. WELMERS 


BENNET E. Mauexesu. Unkhoswe Waanyanja. Edited by Guy Atkins. 124 pp. 

London: Oxford University Press, 1952. 

This is the first of a series of annotated texts in African languages prepared 
under the auspices of the School of Oriental and African languages. The topic in 
this instance is the system of guardianship (uykhoswe), an institution of consider- 
able ethnologic interest and wide repercussions in native life. The 'anguage is 
Southern Nyanja, an important Bantu language of the Lake Nyasa region. The 
value of the present series of texts to the linguist would have been greatly en- 
hanced by an accompanying translation, a glossary, or preferably both. Tones 
are marked in forms cited in the accompanying notes but not in the text itself. 
In addition, both the orthography of the text and the grammatical terminology 
of the notes are only intelligible in the light of Atkins’ other works, his “‘Sug- 
gestions for an Amended Spelling and Word Division of Nyanja’ (Memorandum 
XXV of the International African Institute), and his article, “The Parts of 
Speech in Nyanja” (The Nyasaland Journal, 1950: 7-58). To judge from this 
first sample, the texts which will appear in this series are likely to be of consider- 
able ethnologic and stylistic, as well as purely linguistic interest as instances 
of indigenous African literature. It is to be hoped that subsequent volumes will 
be provided with more adequate aids for those who are not native speakers. The 
ethnologist, at least, deserves a translation. 


Columbia University JoserH H. GREENBERG 


WILHELM Scumipt. Die Tasmanischen Sprachen. Comité International de Lin- 
guistes. Publications d’Enquéte Linguistique V. 521 pp. Utrecht-Anvers: 
Spectrum, 1952. 

The death of the last pure-blooded Tasmanian in 1876 ended for all time the 
possibility of direct study of the Tasmanian languages. What we know is based 
on the work of a handful of investigators, beginning about 1780 with Ander- 
son who accompanied Captain Cook in his voyages of exploration and ending 
with Milligan, who published his results in 1859. Schmidt tells us that the 
present work was finished in all essentials in 1917, but that he awaited the pos- 
sible discovery of new important sources before definitive publications. This hope 
now appears vain. Thanks to the subvention of Unesco and the Deutsche For- 
schungsgemeinschaft, the result of Schmidt’s labors are now available in this 
handsomely printed work. We are all indebted to Father Schmidt for the long 
and patient efforts which went into the production of the present volume. All 
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existing material is recorded by untrained observers, generally in popular 
orthography and rarely with any indication of the intended phonetic values of 
the symbols employed. When we add that one of the most extensive sources, 
Norman, is a hodge-podge of words from various parts of the island, that few of 
the sources give even a vague indication of the locality in which the data were 
gathered and that grammatical information must be gleaned from a small num- 
ber of sentences almost all from our single best source, Milligan, the difficulties 
involved in working with this material can be comprehended. 

The work is divided into three sections. The first discusses the sources, the 
interpretation of the orthographies employed in them and the problem of the 
number of distinct languages, their grouping and their geographical location. 
It also contains a fairly extensive comparative vocabulary arranged according to 
the languages as determined by Schmidt and with the source of each form in- 
dicated. The second section contains a discussion of the phonology of the lan- 
guages, both descriptive and comparative and an exposition of what can be de- 
duced regarding their morphology and syntax. The third part includes an 
alphabetized Tasmanian-German vocabulary in which every existing Tasmanian 
form is recorded. For each set of cognate forms, one occurrence is chosen for a 
general listing of all related forms. Every other occurrence contains a cross- 
reference to the place where it is listed with its cognates. This section runs well 
over 200 pages and contains 1508 etyma many of which, however, only occur in 
a single language. This is followed by a shorter German-Tasmanian vocabulary, 
unfortunately arranged by subject matter. The work concludes with a short list 
of Tasmanian proper names and their interpretation. 

Schmidt’s basic conclusions may be summarized as follows: 1. There are five 
Tasmanian languages. 2. They are all related to each other. 3. The Tasmanian 
languages cannot at present be shown to be related to any other. 4. The five 
languages fall into two groups, an eastern and a western, of these, the eastern 
contains three languages, called here the northeastern, east central and south- 
eastern of which the latter two are particularly close to each other. The western 
group consists of two languages, the western and the northern. With the first 
three of these conclusions, I am in full agreement. The last, which concerns the 
internal grouping of the Tasmanian languages calls for more extended discussion. 

For convenience of reference I retain Schmidt’s numbering of the five languages 
as follows: I Northeast; II East central; III Southeast; IV West; V North. 
Schmidt’s contention, then, is that the fundamental division is between I-II-III, 
the ‘eastern’ languages and IV-V, the ‘western’ languages in which is included 
the western language and the northern language. Within the ‘eastern’ group 
II and III are close together and form a subgroup. 

Schmidt first discusses the problem in terms of vocabulary isoglosses (p. 47-48) 
and comes to the conclusion that II-III-IV are closer to each other than any 
of them is to I or V and that the group II-III-IV is somewhat closer to I than 
it is to V. Examination of the lexical isogloss (and, I may add that of the limited 
number of known grammatical isoglosses) affirms the essential correctness of 
this observation. It will be noted that this conclusion is not in agreement with 
Schmidt’s final results as described above, for in this first formulation IV is 
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aligned with II-III against I and against V whereas in Schmidt’s final conclu” 
sions IV shifts into a grouping with V, and I now joins II-III in an eastern group. 
What is the reason for this shift in conclusions? Schmidt’s own words are as 
follows: (p. 40) Es ergibt sich dass I naher steht zu II, III, (IV) als V es tut. 
(first conclusion). Die Gruppierung verschiebt sich etwas, oder vielmehr sie tritt 
klar hervor wenn wir auch die grammatischen Tatsachen mitheranziehen, etc. 
(italics mine). This is a bit as if one said that French, Italian and Spanish formed 
a group as against German and Dutch and then added that the grouping becomes 
clearer if one adds German to the Romance group and separates it from Dutch. 
What grammatical characteristics are involved? Schmidt cites only one and he 
returns to it again and again as the basis for a sharp eastern-western division. 
The feature in question is the nominal suffix which seems to function as an article 
and which is stated to be -na for I, II and III, -led for IV and the no-doubt 
cognate -liga, -riga for V. What are the facts? One of the sources for language III 
(La Billarditre) regularly has lia which is supposedly characteristic of the 
western group (IV-V). This is explained by the ad hoc assumption of a Misch- 
dialect which has borrowed the form from IV. But even more significantly -na 
occurs occasionally in all our sources for the western languages. Here Schmidt 
himself virtually abandons his own case. (p. 174) ‘Wir werden also genétigt, das 
suffix na als ursprunglich auch in der Westgruppe vorhanden anzunehemen’. 
From all appearances, we have here a typical instance of two variant formations 
in the parent language. As a general methodological procedure we should never 
group languages on the basis of a single isogloss which is contradicted by the 
mass of the evidence. In this case, as we have seen the particular isogloss thus 
honored does not even turn out to have a clear-cut distribution. I tabulated the 
distribution of elements with grammatical functions in the five languages. These 
were twelve in number. There was only a single case where one appears in all 
of Schmidt’s western languages to the exclusion of the eastern, and none in the 
eastern to the exclusion of the western. One item appeared in all five languages. 
Of the remainder no less than six violated the east-west division by appearing in 
some languages in both groups. In general the distributions are like those for the 
lexical isoglosses suggesting a three fold division I, II-III-IV and V with I 
somewhat closer to II-III-IV than to V and II-III particularly close to each 
other! I believe therefore that Schmidt’s eastern-western division is untenable. 

The virtue of Schmidt’s book is clearly shown in the fact that the presentation 
of the material is so complete and clear that it was possible to carry out this 
refutation (if such it is) of one of his conclusions from the book itself without 
referring to the original sources. Schmidt’s work is also of value in demon- 
strating what can be done with even the poorest material by dint of patient and 
intelligent effort. After all, any kind of transcription is evidence that an enquirer 
heard certain things and not others. Other things being equal we greatly prefer a 
phonemicized description. But where it is not available (and for Tasmania this 
will necessarily always be the case) the material is by no means useless for cer- 
tain scientific purposes. 


Columbia University JoserH H. GREENBERG 
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